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(57) The invention describes a new method to se- 
quence DNA. The improvements over the existing DNA 
sequencing technologies are high speed, high through- 
put, no electrophoresis and gel reading artifacts due to 
the complete absence of an electrophoretic step, and 
no costly reagents involving various substitutions with 
stable isotopes. The invention utilizes the Sanger se- 
quencing strategy and assembles the sequence infor- 
mation by analysis of the nested fragments obtained by 
base-specific chain termination via their different molec- 
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ular masses using mass spectrometry, as for example, 
MALDI or ES mass spectrometry. A further increase in 
throughput can be obtained by introducing mass-modi- 
fications in the oligonucleotide primer, chain-terminating 
nucleoside triphosphates and/or in the chain-elongating 
nucleoside triphosphates, as well as using integrated 
tag sequences which allow multiplexing by hybridization 
of tag specific probes with mass-differentiated molecu- 
lar weights. 
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Description 

Background of the Invention 

s [0001] Since the genetic Information is represented by the sequence of the four DNA building blocks deoxyadenosine- 
(dpA), deoxyguanosine- (dpG), deoxycytldlne-(dpC) and deoxythymidine-S'-phosphate (dpT), DNA sequencing is one 
of the most fundamental technologies in molecular biology and the life sciences in general. The ease and the rate by 
which DNA sequences can be obtained greatly affects related technologies such as development and production of 
new therapeutic agents and new and useful varieties of plants and microorganisms via recombinant DNA technology. 

10 in particular, unraveling the DNA sequence helps in understanding human pathological conditions including genetic 
disorders, cancer and AIDS. In some cases, very subtle differences such as a one nucleotide deletion, addition or 
substitution can create serious, in some cases even fatal, consequences. Recently, DNA sequencing has become the 
core technology of the Human Genome Sequencing Project (e.g., J.E. Bishop and M. Waldholz, 1991, Genome: The 
Story of the Most Astonishing Scientific Adventure of Our Time - The Attempt to Map All the Genes in the Human Body , 

is Simon & Schuster, New York). Knowledge of the complete human genome DNA sequence will certainly help to under- 
stand, to diagnose, to prevent and to treat human diseases. To be able to tackle successfully the determination of the 
approximately 3 billion base pairs of the human genome in a reasonable time frame and in an economical way, rapid, 
reliable, sensitive and inexpensive methods need to be developed, which also offer the possibility of automation. The 
present invention provides such a technology. 

20 [0002] Recent reviews of today's methods together with future directions and trends are given by Barrel! (The FASEB 
Journal 5, 40-45 (1991)), andTrainor (Anal. Chem. 62, 418-26(1990)). 

[0003] Currently, DNA sequencing is performed by either the chemical degradation method of Maxam and Gilbert 
(Methods in Enzymology 65, 499-560 (1980)) or the enzymatic dideoxynucleotide termination method of Sanger et at. 
( P roc. Natl. Acad. Sci. USA 74, 5463-67 (1977)). In the chemical method, base specific modifications result in a base 

25 specific cleavage of the radioactive or fluorescently labeled DNA fragment With the four separate base specific cleav- 
age reactions, four sets of nested fragments are produced which are separated according to length by pofyacrylamlde 
gel electrophoresis (PAGE). After autoradiography, the sequence can be read directly since each band (fragment) in 
the gel originates from a base specific cleavage event. Thus, the fragment lengths in the four "ladders" directly translate 
into a specific position in the DNA sequence. 

30 [0004] In the enzymatic chain termination method, the four base specific sets of DNA fragments are formed by starting 
with a primer/template system elongating the primer into the unknown DNA sequence area and thereby copying the 
template and synthesizing a complementary strand by DNA polymerases, such as Klenow fragment of E. coti DNA 
polymerase I, a DNA polymerase from Thermus aquaticus, Taq DNA polymerase, or a modified T7 DNA polymerase, 
Sequenase (Tabor era/., Proc. Natl. Acad.Sci.USA 84, 4767-4771 (1987)), in the presence of chain-terminating rea- 

33 gents. Here, the chain-terminating event is achieved by incorporating into the four separate reaction mixtures in addition 
to the four normal deoxy nucleoside triphosphates, dATP, dGTP, dTTP and dCTP, only one of the chain-terminating 
dJdeoxy nucleoside triphosphates. ddATP, ddGTP, ddTTP orddCTP, respectively, in a limiting small concentration. The 
four sets of resulting fragments produce, after electrophoresis, four base specific ladders from which the DNA sequence 
can be determined. 

40 [0005] A recent modification of the Sanger sequencing strategy involves the degradation of phosphorothioate-con- 
taining DNA fragments obtained by using afpha-thio dNTP instead of the normally used ddNTPs during the primer 
extension reaction mediated by DNA polymerase (Labeit at at., DNA 5, 1 73-1 77 (1 986); Amersham, PCT-Application 
GB86/0O349; Eckstein et at., Nucleic Acids Res . 16, 9947 (1988)). Here, the four sets of base-specific sequencing 
ladders are obtained by limited digestion with exonuclease III or snake venom phosphodiesterase, subsequent sepa- 

4* ration on PAGE and visualization by radioisotopic labeling of either the primer or one of the dNTPs. In a further mod- 
ification, the base-specific cleavage is achieved by alkylating the sulphur atom in the modified phosphodiester bond 
followed by a heat treatment (Max-Planck-Gesellschaft, DE 3930312 A1). Both methods can be combined with the 
amplification of the DNA via the Polymerase Chain Reaction (PCR). 

[0006] On the upfront end, the DNA to be sequenced has to be fragmented into sequencable pieces of currently not 
50 more than 500 to 1000 nucleotides. Starting from a genome, this is a multi-step process involving cloning and subcloning 
steps using different and appropriate cloning vectors such as YAC, cosmlds, plasmids and M13 vectors (Sambrook et 
al„ Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1989). Finally, for Sanger sequenc- 
ing, the fragments of about 500 to 1 000 base pairs are integrated into a specific restriction site of the replicatlve form 
I (RF I) of a derivative of the M13 bacteriophage (Vieria and Messing, Gene 19, 259 (1982)) and then the double- 
55 stranded form is transformed to the single-stranded circular form to serve as a template for the Sanger sequencing 
process having a binding site for a universal primer obtained by chemical DNA synthesis (Sinha, Biernat, McManus 
and Koster, Nucleic Adds Res. 12, 4539-57 (1 984); U.S. Patent No. 4725677 upstream of the restriction site into which 
the unknown DNA fragment has been inserted. Under specific conditions, unknown DNA sequences integrated into 
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supercoiled double-stranded plasmid DNA can be sequenced directly by the Sanger method (Chen and Seeburg, DNA 
4, 165-170 (1985)) and Lim etat, Gene Anal. Techn, 5, 32-39 (1988), and, with the Polymerase Chain Reaction (PCR) 
( PCR Protocols: A Guide to Methods and Applications . Innis eta]., editors, Academic Press, San Diego (1 990)) cloning 
or subcloning steps could be omitted by directly sequencing off chromosomal DNA by first amplifying the DNA segment 
5 by PCR and then applying the Sanger sequencing method (Innis ef at, Proc. Natl. Acad. Sci. USA 85 , 9436- 9440 
(1988)). In this case, however, the DNA sequence in the interested region most be known at least to the extent to bind 
a sequencing primer. 

[0007] In order to be able to read the sequence from PAGE, detectable labels have to be used In either the primer 
(very often at the 5' -end) or in one of the deoxy nucleoside triphosphates, dNTP. Using radioisotopes such as ^P, 33 P. 

10 or 35S is still the most frequently used technique. After PAGE, the gels are exposed to X-ray films and silver grain 
exposure is analyzed. The use of radioisotopic labeling creates several problems. Most labels useful for autoradio- 
graphic detection of sequencing fragements have relatively short half-lives which can limit the useful time of the labels. 
The emission high energy beta radiation, particularly from 32 P i can lead to breakdown of the products via radio lysis so 
that the sample should be used very quickly after labeling, in addition, high energy radiation can also cause a deteri- 

'5 oration of band sharpness by scattering. Some of these problems can be reduced by using the less energetic isotopes 
such as 33 P or 35 S (see, e.g., Ornstein ef a/., Biotechniques 3, 476 (1985)). Here, however, longer exposure times 
have to be tolerated. Above all, the use of radioisotopes poses significant health risks to the experimentalist and, in 
heavy sequencing projects, decontamination and handling the radioactive waste are other severe problems and bur- 
dens. 

?0 [0008] In response to the above mentioned problems related to the use of radioactive labels, non-radioactive labeling 
techniques have been explored and, in recent years, Integrated into partly automated DNA sequencing procedures. 
All these improvements utilize the Sanger sequencing strategy. The fluorescent label can be tagged to the primer 
(Smith et aft, Nature 321 , 674-679 (1986) and EPO Patent No. 87300998.9; Du Pont De Nemours EPO Application 
No. 0359225; Ansorge et a/. J. Biochem. Biophys. Methods 13 , 325-32 (1 986)) or to the chain -terminating dideoxynu- 

25 cioside triphosphates (Prober etaf. Science 238 , 336-41 (1 987); Applied Biosystems, PCT Application WO 91/05060). 
Based on either labeling the primer or the ddNTP, systems have been developed by Applied Biosystems (Smith et at, 
Science 235 , G89 (1987); U.S. Patent Nos. 570973 and 689013), Du Pont De Nemours (Prober et at Science 238 , 
336-341 (1987); U.S. Patents Nos. 881372 and 57566), Phanmacia-LKB (Ansorge ef at Nucleic Acids Res , 15, 
4593-4602 (1987) and EMBL Patent Application DE P3724442 and P3805808.1) and Hitachi (JP 1-90844 and DE 

30 4011991 A1). A somewhat similar approach was developed by Brumbaugh et at (Proc. Natl. Sci. USA B5 , 5610-14 
(1988) and U.S. Patent No. 4,729,947). An improved method for the Du Pont system using two electrophoretic lanes 
with two different specific labels per lane Is described (PCT Application WO92/02635). A different approach uses flu- 
orescentfy labeled avldin and biotln labeled primers. Here, the sequencing ladders ending with biotin are reacted during 
electrophoresis with the labeled avidin which results in the detection of the individual sequencing bands (Brumbaugh 

35 etat, U.S. Patent No. 594676). 

[0009] More recently even more sensitive non-radioactive labeling techniques for DNA using chemiluminescence 
triggerable and amplifyable by enzymes have been developed (Beck, O'Keefe, Couil and Koster, Nucleic Acids Res. 
17 , 51 1 5-51 23 (1989) and Beck and Koster, Anal. Chem. 62 , 2258-2270 (1 990)). These labeling methods were com- 
bined with multiplex DNA sequencing (Church etat Science 240, 185-188 (1988) to provide for a strategy aimed at 

40 high throughput DNA sequencing (Koster ef a/., Nucleic Acids Res. Symposium Ser. No. 24, 31 8-321 (1 991 ), University 
of Utah, PCT Application No. WO 90/15883); this strategy still suffers from the disadvantage of being very laborious 
and difficult to automate. 

[0010] in an attempt to simplify DNA sequencing, solid supports have been introduced. In most cases published so 
far, the template strand for sequencing (with or without PCR amplification) is immobilized on a solid support most 

^5 frequently utilizing the strong biotin-avidin/streptavidln interaction (Orion-Yhtyma Oy» U.S. Patent No. 277643; M. Uhlen 
ef a/. Nucleic Acids Res. 16, 3025-38 (1988); Cemu Bioteknik, PCT Application No. WO 89/09282 and Medical Re- 
search Council, GB, PCT Application No. WO 92/03575). The primer extension products synthesized on the immobi- 
lized template strand are purified of enzymes, other sequencing reagents and byproducts by a washing step and then 
released under denaturing conditions by loosing the hydrogen bonds between the Watson-Crick base pairs and sub- 

so jected to PAGE separation. In a different approach, the primer extension products (not the template) from a DNA 
sequencing reaction are bound to a solid support via biotin/avldin (Du Pont De Nemours, PCT Application WO 
91/11533). In contrast to the above mentioned methods, here, the interaction between biotln and avidin is overcome 
by employing denaturing conditions (formamide/EDTA) to release the primer extension products of the sequencing 
reaction from the solid support for PAGE separation. As solid supports, beads, (e.g., magnetic beads (Dynabeads) 

55 and Sepharose beads), filters, capillaries, plastic dipsticks (e.g., polystyrene strips) and microliter wells are being 
proposed. 

[0011] All methods discussed so far have one central step in common: 

pofyacrylamide gel electrophoresis (PAGE). In many instances, this represents a major drawback and limitation for 
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each of these methods. Preparing a homogeneous gel by polymerization, loading of the samples, the electrophoresis 
itself, detection of the sequence pattern (e.g., by autoradiography), removing the gel and cleaning the glass plates to 
prepare another gel are very laborious and time-consuming procedures. Moreover, the whole process Is error-prone, 
difficult to automate, and, in order to improve reproducibility and reliability, highly trained and skilled personnel are 

5 required. In the case of radioactive labeling, autoradiography itself can consume from hours to days. In the case of 
fluorescent labeling, at least the detection of the sequencing bands is being performed automatically when using the 
laser-scanning devices integrated into commercial available DNA sequencers. One problem related to the fluorescent 
labeling Is the influence of the four different base-specific fluorescent tags on the mobility of the fragments during 
electrophoresis and a possible overlap in the spectral bandwidth of the four specific dyes reducing the discriminating 

10 power between neighboring bands, hence, increasing the probability of sequence ambiguities. Artifacts are also pro- 
duced by base-specific interactions with the polyacrylamide gel matrix (Frank and Koster, Nucleic Acids Res. 6, 2069 
(1979)) and by the formation of secondary structures which result in "band compressions" and hence do not allow one 
to read the sequence. This problem has, in part, been overcome by using 7-deazadeoxyguanosine triphosphates (Barr 
et a/., Biotechniques 4, 428 (1986)). However, the reasons for some artifacts and conspicuous bands are still under 

is investigation and need further improvement of the gel electrophoretic procedure. 

[0012] A recent innovation in electrophoresis is capillary zone electrophoresis (CZE) (Jorgenson et al., J. Chroma- 
tography 352 , 337 (19B6); Gesteland et a/., Nucleic Acids Res. 18, 1415-1419 (1990)) which, compared to slab gel 
electrophoresis (PAGE), significantly increases the resolution of the separation, reduces the time for an electrophoretic 
run and allows the analysis of very small samples. Here, however, other problems arise due to the miniaturization of 

20 the whole system such as wall effects and the necessity of highly sensitive on-line detection methods. Compared to 
PAGE, another drawback is created by the fact that CZE is only a "one-lane" process, whereas in PAGE samples in 
multiple lanes can be electro phoresed simultaneously. 

[0013] Due to the severe limitations and problems related to having PAGE as an integral and central part in the 
standard DNA sequencing protocol, several methods have been proposed to do DNA sequencing without an electro- 
ns phoretic step. One approach calls for hybridization or fragmentation sequencing (Bains, Biotechnology 10 , 757-58 
(1992) and Mirzabekov et a/. f FEBS Letters 256 , 11B-122 (1989)) utilizing the specific hybridization of known short 
oligonucleotides (e.g., octadeoxynucleotides which gives 65,536 different sequences) to a complementary DNA se- 
quence. Positive hybridization reveals a short stretch of the unknown sequence. Repeating this process by performing 
hybridizations with all possible octadeoxynucleotides should theoretically determine the sequence. In a completely 
30 different approach, rapid sequencing of DNA is done by unilaterally degrading one single, immobilized DNA fragment 
by an exonuciease in a moving flow stream and detecting the cleaved nucleotides by their specific fluorescent tag via 
laser excitation (Jett era/., J. Biomolecular Structure & Dynamics 7, 301-309, (1989); United States Department of 
Energy, PCT Application No. WO 89/03432). In another system proposed by Hyman (Anal. Biochem. 174, 423-436 
(1988)), the pyrophosphate generated when the correct nucleotide is attached to the growing chain on a primer-tem- 
3s plate system is used to determine the DNA sequence. The enzymes used and the DNA are held in place by solid 
phases (DEAE-Sepharose and Sepharose) either by ionic interactions or by covalent attachment. In a continuous flow- 
through system, the amount of pyrophosphate is determined via biofuminescence (luciferase). A synthesis approach 
to DNA sequencing is also used by Tsien et al. (PCT Application No. WO 91/06678). Here, the incoming dNTPs are 
protected at the 3*-end by various blocking groups such as acetyl or phosphate groups and are removed before the 
40 next elongation step, which makes this process very slow compared to standard sequencing methods. The template 
DNA is immobilized on a polymer support. To detect incorporation, a fluorescent or radioactive label is additionally 
incorporated into the modified dNTP's. The same patent application also describes an apparatus designed to automate 
the process. 

[001 4] Mass spectrometry, in general, provides a means of "weighing" Individual molecules by Ionizing the molecules 
45 in vacuo and making them "fly" by volatilization. Under the influence of combinations of electric and magnetic fields, 
the ions follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low 
molecular weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and 
characterization of organic molecules by the determination of the mass of the parent molecular ion. In addition, by 
arranging collisions of this parent molecular ion with other particles (e.g., argon atoms), the molecular ion is fragmented 
so forming secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very 
often allows the derivation of detailed structural information. Many applications of mass spectrometry methods in the 
known in the art, particularly in biosciences, and can be found summarized in Methods in Enzymology , Vol. 1 93: "Mass 
Spectrometry" (J.A. McCloskey, editor), 1990, Academic Press, New York, 

[0015] Due to the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, ac- 
55 curacy of mass measurements, detailed structural information by CID in conjunction with an MS/MS configuration and 
speed, as well as on-line data transfer to a computer, there has been considerable interest in the use of mass spec- 
trometry for the structural analysis of nucleic acids. Recent reviews summarizing this field include K. H. Schram, "Mass 
Spectrometry of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry" 34, 203-2B7 (1990); and 
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P.F. Crain, "Mass Spectrometry Techniques in Nucleic Acid Research ,* Mass Spectrometry Reviews 9, 505-554 (1 990). 
The biggest hurdle to applying mass spectrometry to nucleic acids is the difficulty of volatilizing these very polar bt- 
opoJymers. Therefore, "sequencing" has been limited to low molecular weight synthetic oligonucleotides by determining 
the mass of the parent molecular ion and through this, confirming the already known sequence, or alternatively, con- 
s firming the known sequence through the generation of secondary ions (fragment Ions) via CID in an MS/MS configu- 
ration utilizing, in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass 
spectrometry) or plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis 
of protected dimeric blocks for chemical synthesis of oligodeoxy nucleotides has been described (Kdster etaf. Biomed- 
ical Environmental Mass Spectrometry 14 , 111-116(1987)). 

10 [0016] Two more recent ionlzation/desorption techniques are electroepray/ionspray (ES) and matrix-assisted laser 
desorption/lonlzatlon (MALDI). ES mass spectrometry has been introduced by Fenn et at. (J.Phys.Chem . 88 , 4451-59 
(1984); PCT Application No. WO 90/14148) and current applications are summarized in recent review articles (R.D. 
Smith era/., Anal. Chem. 82 , 882-89 (1990) and B. Ardrey, Electrospray Mass Spectrometry, Spectroscopy Europe , 
4, 10-18 (1992)). The molecular weights of the tetradecanucleotide d(CATGCCATGGCATG) (SEQ ID NO:1) (Covey 

*5 et at. The Determination of Protein, Oligonucleotide and Peptide Molecular Weights by lonspray Mass Spectrometry," 
Rapid Communications in Mass Spectrometry , 2, 249-256 (1988)), of the 21-mer d(AAATTGTGCACATCCTGCAGC) 
(SEQ ID NO:2) and without giving details of that of a tRNA with 76 nucleotides ( Methods In Enzymology , 193 , "Mass 
Spectrometry 1 ' (McCloskey, editor), p. 425, 1 990, Academic Press, New York) have been published. As a mass analyzer, 
a quadrupole is most frequently used. The determination of molecular weights in femtomole amounts of sample is very 

20 accurate due to the presence of multiple ion peaks which afl could be used for the mass calculation. 

[0017] MALDI mass spectrometry, in contrast, can be particularly attractive when a time-of-f light (TOF) configuration 
is used as a mass analyzer. The MALDI-TOF mass spectrometry has been introduced by Hillenkamp et af. ("Matrix 
Assisted UV-Laser Desorptlon/lonization: A New Approach to Mass Spectrometry of Large Biomolecules," Biological 
Mass Spectrometry (Burlingame and McCloskey, editors), Elsevier Science Publishers, Amsterdam, pp. 49-60, 1990.) 

25 Since, in most cases, no multiple molecular ion peaks are produced with this technique, the mass spectra, in principle, 
look simpler compared to ES mass spectrometry. Although DNA molecules up to a molecular weight of 41 0,000 daltons 
could be desorbed and volatilized (Williams et at., "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation 
of Frozen Aqueous Solutions," Science, 246 , 1585-87 (1989)), this technique has so far only been used to determine 
the molecular weights of relatively small oligonucleotides of known sequence, e.g., oligothymidyiic acids up to 18 

30 nucleotides (Huth-Fehre et at. t "Matrix-Assisted Laser Desorption Mass Spectrometry of Oligodeoxythymidylic Acids," 
Rapid Communications in Mass Spectrometry , 6, 209-1 3 (1 992)) and a double-stranded DNA of 28 base pairs (Williams 
et at., "Tlme-of- Flight Mass Spectrometry of Nucleic Acids by Laser Abfation and Ionization from a Frozen Aqueous 
Matrix,* Rapid Communications in Mass Spectrometry , 4, 348-351 (1 990)). In one publication (Huth-Fehre etai., 1 992 , 
supra), it was shown that a mixture of all the oligothymidyiic acids from n=12 to n=18 nucleotides coukJ be resolved. 

33 [0018] In U.S. Patent No. 5,064,754, RNA transcripts extended by DNA both of which are complementary to the 
DNA to be sequenced are prepared by incorporating NTP's, dNTP's and, as terminating nucleotides, ddNTP's which 
are substituted at the 5-position of the sugar moiety with one or a combination of the 
isotopes 12 C. 13 C, 14 C, 1 H, 2 H, 3 H, te O, 17 0 and 18 0. The polynucleotides obtained are degraded to S'-nucleotides, 
cleaved at the N-glycosidic linkage and the isotopicaify labeled 5' -functionality removed by pertodate oxidation and the 

to resulting formaldehyde species determined by mass spectrometry. A specific combination of isotopes serves to dis- 
criminate base-specifically between internal nucleotides originating from the incorporation of NTP's and dNTP's and 
terminal nucieotides caused by linking ddNTP's to the end of the polynucleotide chain. A series of RNA/DNA fragments 
is produced, and in one embodiment, separated by electrophoresis, and, with the aid of the so-caJJed matrix method 
of analysis, the sequence is deduced. 

49 [0019] In Japanese Patent No. 59-131909, an instrument is described which detects nucleic acid fragments sepa- 
rated either by electrophoresis, liquid chromatography or high speed gel filtration. Mass spectrometric detection is 
achieved by incorporating into the nucleic acids atoms which normally do not occur in DNA such as S, Br, I or Ag, Au, 
Pt, Os, Hg. The method, however, is not applied to sequencing of DNA using the Sanger method. In particular, it does 
not propose a base-specific correlation of such elements to an individual ddNTP. 

« [0020] PCT Application No. WO 89/1 2694 (Brennan et at. , Proc. SPIE-lnL Soc. Opt. Eng. 1206, (NewTechnoi.Cytom. 
Mol. Biol ), pp. 60-77 (1990); and Brennan, U.S. Patent No. 5,003,059) employs the Sanger methodology for DNA 
sequencing by using a combination of either the four stable isotopes ^S, 33 S, ^S, 36 S or ^Cl, 37 CI, 79 Br, 81 Br to spe- 
cifically label the chain-terminating ddNTP's. The sulfur isotopes can be located either in the base or at the alpha- 
position of the triphosphate moiety whereas the halogen isotopes are located either at the base or at the 3'-posltion of 

55 the sugar ring. The sequencing reaction mixtures are separated by an electrophoretic technique such as CZE, trans- 
ferred to a combustion unit in which the sulfur isotopes of the incorporated ddNTP's are transformed at about 900°C 
in an oxygen atmosphere. The S0 2 generated with masses of 64, 65, 66 or 68 is determined on-line by mass spec- 
trometry using, e.g., as mass analyzer, a quadrupole with a single ion-multiplier to detect the ion current. 
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[0021] A similar approach is proposed in U.S. Patent No. 5,002,868 (Jacobson et at. , Proc. SPIE-Jnt, Soc f Opt, Eng. 
1435, (Opt. Methods Ultrasensitive Detect. Anal. Tech. Appl.) , 26-35 (1991)) using Sanger sequencing with four 
ddNTP's specifically substituted at the alpha-position of the triphosphate moiety with one of the four stable sulfur iso- 
topes as described above and subsequent separation of the four sets of nested sequences by tube gel electrophoresis. 
5 The only difference is the use of resonance ionization spectroscopy (RIS) in conjunction with a magnetic sector mass 
analyzer as disclosed in U.S. Patent No. 4,442,354 to detect the sulfur isotopes corresponding to the specific nucleotide 
terminators, and by this, allowing the assignment of the DNA sequence. 

[0022] EPO Patent Applications No. 0360676 A1 and 0360677 A1 also describe Sanger sequencing using stable 
isotope substitutions in the ddNTP's such as D, ,3 C, 15 N, 17 0, 1fi O, 32 S, ^S, 34 S, ^S, 19 F, 35 CI, 37 CI, 79 Br, 81 Br 

10 and 127 l or functional groups such as CF 3 or Si(CH 3 ) 3 at the base, the sugar or the alpha position of the triphosphate 
moiety according to chemfcaf functionality. The Sanger sequencing reaction mixtures are separated by tube gel elec- 
trophoresis. The effluent is converted into an aerosol by the etectrospray/thermospray nebulizer method and then 
atomized and ionized by a hot plasma (7000 to 8000° K) and analyzed by a simple mass analyzer. An instrument is 
proposed which enables one to automate the analysis of the Sanger sequencing reaction mixture consisting of tube 

'5 electrophoresis, a nebulizer and a mass analyzer. 

[0023] The application of mass spectrometry to perform DNA sequencing by the hybridization/fragment method (see 
above) has been recently suggested (Bains, "DNA Sequencing by Mass Spectrometry: Outline of a Potential Future 
Application," Chimicaoggi -9, 13*16(1991)). 



20 Summary of the Invention 



[0024] The invention describes a new method to sequence DNA. The improvements over the existing DNA sequenc- 
ing technologies include high speed, high throughput, no required electrophoresis (and, thus, no gel reading artifacts 
due to the complete absence of an electrophoretic step), and no costly reagents involving various substitutions with 
stable isotopes. The invention utilizes the Sanger sequencing strategy and assembles the sequence information by 
analysis of the nested fragments obtained by base-specific chain termination via their different molecular masses using 
mass spectrometry, for example, MALDI or ES mass spectrometry. A further increase in throughput can be obtained 
by introducing mass modifications in the oligonucleotide primer, the chain-terminating nucleoside triphosphates and/ 
or the chain-elongating nucleoside triphosphates, as we/I as using integrated tag sequences which allow multiplexing 
by hybridization of tag specific probes with mass differentiated molecular weights. 



Brief Description of the FIGURES 



[0025] 

FIGURE 1 is a representation of a process to generate the samples to be analyzed by mass spectrometry. This 
process entails insertion of a DNA fragment of unknown sequence into a cloning vector such as derivatives of 
M13, pUC or phagemids; transforming the double-stranded form into the single-stranded form; performing the four 
Sanger sequencing reactions; linking the base-specifically terminated nested fragment family temporarily to a solid 
support; removing by a washing step all by-products; conditioning the nested DNA or RNA fragments by, for ex- 
ample, cation- ion exchange or modification reagent and presenting the immobilized nested fragments either di- 
rectly to mass spectrometry analysis or cleaving the purified fragment family off the support and evaporating the 
cleavage reagent. 

FIGURE 2A shows the Sanger sequencing products using dcfTTP as terminating deoxynucleoside triphosphate 
of a hypothetical DNA fragment of 50 nucleotides (SEQ ID NO:3) in length with approximately equally balanced 
base composition. The molecular masses of the various chain terminated fragments are given. 
FIGURE 2B shows an idealized mass spectrum of such a DNA fragment mixture. 

FIGURES 3A and 3B show, In analogy to FIGURES 2A and 2B, data for the same model sequence (SEQ ID NO: 
3) with ddATP as chain terminator. 

FIGURES 4A and 4B show data, analogous to FIGURES 2A and 2B when ddGTP is used as a chain terminator 
for the same model sequence (SEQ ID NO:3). 

FIGURES 5A and 5B illustrate the results obtained where chain termination is performed with ddCTP as a chain 
terminator, in a similar way as shown in FIGURES 2A and 2B for the same model sequence (SEQ ID NO:3). 
FIGURE 6 summarizes the results of FIGURES 2A to 5B, showing the correlation of molecular weights of the 
nested four fragment families to the DNA sequence (SEQ ID NO:3). 

FIGURES 7A and 7B illustrate the general structure of mass-modified sequencing nucleic acid primers or tag 
sequencing probes for either Sanger DNA or Sanger RNA sequencing. 

FIGURES 8 A and 8B show the general structure for the mass-modified triphosphates for either Sanger DNA or 
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Sanger RNA sequencing. General formulas of the chain-elongating and the chain -terminating nucleoside triphos- 
phates are demonstrated. 

FIGURE 9 outlines various linking chemistries (X) with either polyethylene glycol or terminally monoalkylated pol- 
yethylene glycol (R) as an example. 

FIGURE 10 illustrates similar linking chemistries as shown in FIGURES 8A and 8B and depicts various mass 
modifying moieties (R). 

FIGURE 1 1 outlines how multiplex mass spectrometry sequencing can work using the mass-modified nucleic acid 
primer (UP). 

FIGURE 12 shows the process of multiplex mass spectrometry sequencing employing mass-modified chain-elon- 
gating and/or terminating nucleoside triphosphates. 

FIGURE 13 shows multiplex mass spectrometry sequencing by involving the hybridization of mass-modified tag 
sequence specific probes. 

FIGURE 14 shows a MALDI-TOF spectrum of a mixture of oligothymidylic acids, d(pT) 12-1 B. 

FIGURE 1 5 shows a superposition of MALDI-TOF spectra of the 50.^ dfTAACGGTCATTACGGCCATTGACTG- 

TAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) (500 fmol) and oTfpdnQgQ (500 fmol). 

FIGURES 16A-1 6M show the MALDI-TOF spectra of all 13 DN A sequences representing the nested cTT-terminated 

fragments of the Sanger DNA sequencing simulation of Figure 2, 500 fmol each, as follows: 16A is a 7-mer; 16B 

is a 1 0-mer; 1 6C is a 11 -mer; 1 6D is a 1 9-mer; 16E is a 20-mer; 1 6F is a 24-mer; 1 6G is a 26-mer; 1 6H is a 33-mer; 

161 is a 37-mer; 16J is a 38-mer; 16K is a 42-mer; 16L is a 46-mer and 16M is a 50-mer. 

FIGURES 17A and 17B show the superposition of the spectra of FIGURE 1 6. The two panels show two different 
scales and the spectra analyzed at that scale. Figure 1 7A shows the superposition of the spectra of 16A-1 6F. The 
letter above each peak corresponds to the original spectra of the fragment in FIGURE 16. For example, peak B 
corresponds to FIGURE 16B; peak C corresponds to FIGURE 16C, etc. 

FIGURE 1 8 shows the superimposed MALDI-TOF spectra from MALDI-MS analysis of mass-modified oligonucle- 
otides as described in Example 21. 

FIGURE 19 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) 
through a strong electrostatic interaction. 

FIGURES 20A and 20B Illustrate various linking chemistries between the solid support (P) and the nucleic acid 
primer (NA) through a charge transfer complex of a charge transfer acceptor (A) and a charge transfer donor (D). 
FIGURE 21 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) 
through a stable organic radical. 

FIGURE 22 illustrates a possible linking chemistry between the solid support (P) and the nucleic acid primer (NA) 
through Watson-Crick base pairing. 

FIGURE 23 illustrates linking the solid support (P) and the nucleic acid primer (NA) through a phototyticaily cleav- 
able bond. 

Detailed Description of the Invention 

[0026] This invention describes an improved method of sequencing DNA. In particular, this invention employs mass 
spectrometry, such as matrix-assisted laser desorption/ionization (MALDI) or elect rospray (ES) mass spectrometry 
(MS), to analyze the Sanger sequencing reaction mixtures. 

[0027] In Sanger sequencing, four families of chain-terminated fragments are obtained. The mass difference per 
nucleotide addition is 289.19 for dpC, 313.21 for dpA, 329.21 for dpG and 304.2 for dpT, respectively. 
[0028] In one embodiment, through the separate determination of the molecular weights of the four base-specifteally 
terminated fragment families, the DNA sequence can be assigned via superposition (e.g., Interpolation) of the molecular 
weight peaks of the four individual experiments. In another embodiment, the molecular weights of the four specifically 
terminated fragment families can be determined simultaneously by MS, either by mixing the products of all four reactions 
run in at least two separate reaction vessels (i.e., all run separately, or two together, or three together) or by running 
one reaction having all four chain-terminating nucleotides (e.g., a reaction mixture comprising dTTP, ddTTP, dATP, 
ddATP, dCTP, ddCTP, dGTP, ddGTP) in one reaction vessel. By simultaneously analyzing all four base-specif ically 
terminated reaction products, the molecular weight values have been, In effect, interpolated. Comparison of the mass 
difference measured between fragments with the known masses of each chain-terminating nucleotide allows the as- 
signment of sequence to be carried out. In some instances, it may be desirable to mass modify, as discussed below, 
the chain-terminating nucleotides so as to expand the difference in molecular weight between each nucleotide. It will 
be apparent to those skilled in the art when mass-modification of the chain-terminating nucleotides is desirable and 
can depend, for Instance, on the resolving ability of the particular spectrometer employed. By way of example, it may 
be desirable to produce four chain -terminating nucleotides, ddTTP, ddCTP 1 , ddATP 2 and ddGTP 3 where ddCTP 1 , 
ddATP 2 and ddGTP 3 have each been mass-modified so as to have molecular weights resolvable from one another by 
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the particular spectrometer being used. 

[0029] The terms chaln-elongatlng nucleotides and chain -terminating nucleotides are well known in the art. For DNA, 
chain-elongating nucleotides include ^-deoxyribonucleotldes and chain-terminating nucleotides include 2\ 3-dideox- 
y ribonucleotides. For RNA, chain-elongating nucleotides include ribonucelotides and chain-terminating nucleotides 
5 include 3'deoxyribonucleotides. The term nucleotide is also well known in the art. For the purposes of this invention, 
nucleotides include nucleoside mono-, di-, and triphosphates. Mucleotides also include modified nucleotides such as 
phosphorothloate nucleotides. 

[0030] Since mass spectrometry is a serial method, in contrast to currently used slab gel electrophoresis which allows 
several samples to be processed in parallel, in another embodiment of this invention, a further improvement can be 

10 achieved by multiplex mass spectrometry DNA sequencing to allow simultaneous sequencing of more than one DNA 
or RNA fragment. As described in more detail below, the range of about 300 mass units between one nucleotide addition 
can be utilized by employing either mass-modified nucleic acid sequencing primers or chain-elongating and/or termi- 
nating nucleoside triphosphates so as to shift the molecular weight of the base-specifically terminated fragments of a 
particular DNA or RNA species being sequenced in a predetermined manner. For the first time, several sequencing 

is reactions can be mass spectrometrlcalfy analyzed in parallel. In yet another embodiment of this invention, multiplex 
mass spectrometric DNA sequencing can be performed by mass modifying the fragment families through specific 
oligonucleotides (tag probes) which hybridize to specific tag sequences within each of the fragment families, in another 
embodiment, the tag probe can be coval entry attached to the individual and specific tag sequence prior to mass spec- 
trometry. 

20 [0031] In one embodiment of the invention, the molecular weight values of at least two base-specifically terminated 
fragments are determined concurrently using mass spectrometry. The molecular weight values of preferably at least 
five and more preferably at least ten base-specifically terminated fragments are determined by mass spectrometry. 
Also included in the invention are determinations of the molecular weight values of at least 20 base-specificaily termi- 
nated fragments and at least 30 base-specifically terminated fragments. Further, the nested base-specifically termi- 

2S nated fragments in a specific set can be purified of all reactants and by-products but are not separated from one another. 
The entire set of nested base-specif icalfy terminated fragments is analyzed concurrently and the molecular weight 
values are determined. At least two base-specifically terminated fragments are analyzed concurrently by mass spec- 
trometry when the fragments are contained in the same sample. 

[0032] In general, the overall mass spectrometric DNA sequencing process will start with a library of small genomic 

30 fragments obtained after first randomly or specifically cutting the genomic DNA into large pieces which then, in several 
subdoning steps, are reduced in size and inserted into vectors like derivatives of M13 or pUC (e.g., M13mp18 or 
M13mp19) (see FIGURE 1). In a different approach, the fragments inserted in vectors, such as M13, are obtained via 
subdoning starting with a cDNA library. In yet another approach, the DNA fragments to be sequenced are generated 
by the polymerase chain reaction (e.g., Higuchi eta!., "A General Method of in vitro Preparation and Mutagenesis of 

35 DNA Fragments: Study of Protein and DNA Interactions," Nucleic Acids Res. , 16, 7351-67 (1 988)). As is known in the 
art, Sanger sequencing can start from one nucleic acid primer (UP) binding to the plus-strand or from another nucleic 
acid primer binding to the opposite minus-strand. Thus, either the complementary sequence of both strands of a given 
unknown DNA sequence can be obtained (providing for reduction of ambiguity in the sequence determination) or the 
length of the sequence information obtainable from one clone can be extended by generating sequence information 

40 from both ends of the unknown vector-inserted DNA fragment. 

[0033] The nucleic acid primer carries, preferentially at the S'-end, a linking functionality, L, which can include a 
spacer of sufficient length and which can interact with a suitable functionality, L\ on a solid support to form a reversible 
linkage such as a photocleavable bond. Since each of the four Sanger sequencing families starts with a nucleic acid 
primer (L-UP; FIGURE 1 ) this fragment family can be bound to the solid support by reacting with functional groups, L\ 

45 on the surface of a solid support and then intensively washed to remove all buffer salts, triphosphates, enzymes, 
reaction by-products, etc. Furthermore, for mass spectrometric analysis, it can be of Importance at this stage to ex- 
change the cation at the phosphate backbone of the DNA fragments in order to eliminate peak broadening due to a 
heterogeneity in the cations bound per nucleotide unit. Since the U-L' linkage is only of a temporary nature with the 
purpose to capture the nested Sanger DNA or RNA fragments to property condition them for mass spectrometric 

50 analysis, there are different chemistries which can serve this purpose, in addition to the examples given in which the 
nested fragments are coupled covalently to the solid support, washed, and cleaved off the support for mass spectro- 
metric analysis, the temporary linkage can be such that it is cleaved under the conditions of mass spectrometry. I.e., 
a photocleavable bond such as a charge transfer complex or a stable organic radical. Furthermore, the linkage can be 
formed with L' being a quaternary ammonium group (some examples are given in FIGURE 1 9). In this case, preferably, 

55 the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone 
and thus facilitates desorption. Desorption will take place either by the heat created by the laser pulse and/or, depending 
on L,' by specific absorption of laser energy which is in resonance with the L* chromophore (see, e.g., examples given 
in FIGURE 1 9). The functionalities, L and L/ can also form a charge transfer complex and thereby form the temporary 
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L-U linkage. Various examples for appropriate functionalities with either acceptor or donator properties are depicted 
without limitation in FIGURES 20A and 20B. Since in many cases the "charge-transfer band" cart be determined by 
UV/vis spectrometry (see e.g. organic Charge Transfer Complexes by R. Foster, Academic Press, 1969), the laser 
energy can be tuned to the corresponding energy of the charge-transfer wavelength and, thus, a specific desorption 
£ off the solid support can be initiated. Those skilled in the art will recognize that several combinations can serve this 
purpose and that the donor functionality can be either on the solid support or coupled to the nested Sanger ONA/RNA 
fragments or vice versa. 

[0034] In yet another approach, the temporary linkage L-U can be generated by homolytically forming relatively stable 
radicals as exemplified in FIGURE 21 . In example 4 of FIGURE 21 , a combination of the approaches using charge- 
to transfer complexes and stable organic radicals is shown. Here, the nested Sanger DNA/RNA fragments are captured 
via the formation of a charge transfer complex. Under the influence of the laser pulse, desorption (as discussed above) 
as well as ionization will take place at the radical position. In the other examples of FIGURE 21 under the influence of 
the laser pulse, the L-L* linkage will be cleaved and the nested Sanger DNA/RNA fragments desorbed and subsequently 
ionized at the radical position formed. Those skilled in the art will recognize that other organic radicals can be selected 
'5 and that, in relation to the dissociation energies needed to homolytfcalry cleave the bond between them, a corresponding 
laser wavelength can be selected (see e.g. Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). In yet 
another approach, the nested Sanger DNA/RNA fragments are captured via Watson-Crick base pairing to a solid 
support-bound oligonucleotide complementary to either the sequence of the nucleic acid primer or the tag oligonucle- 
otide sequence (see FIGURE 22). The duplex formed will be cleaved under the influence of the laser pulse and des- 
2° orption can be initiated. The solid support-bound base sequence can be presented through natural ollgoribo- or oligo- 
deoxyribonucleotide as well as analogs (e.g. thio-modified phosphodiester or phosphotriester backbone) or employing 
oligonucleotide mlmetics such as PNA analogs (see e.g. Nielsen efa/., Science , 254, 1497 (1991)) which render the 
base sequence less susceptible to enzymatic degradation and hence increases overall stability of the solid support- 
bound capture base sequence. With, appropriate bonds, L-L 1 , a cleavage can be obtained directly with a laser tuned 
25 to the energy necessary for bond cleavage. Thus, the immobilized nested Sanger fragments can be directly ablated 
during mass spectrometric analysis. 

[0035] To increase mass spectrometric performance, it may be necessary to modify the phosphodiester backbone 
prior to MS analysis. This can be accomplished by, for example, using alpha-thlo modified nucleotides for chain elon- 
gation and termination. With alkylating agents such as akyllodldes, lodoacetamide, (Hodoethanol, 2,3-epoxy-1-propa- 

30 nol (see FIGURE 1 0), the monothio phosphodiester bonds of the nested Sanger fragments are transformed into phos- 
photriester bonds. Multiplexing by mass modification in this case is obtained by mass-modifying the nucleic acid primer 
(UP) or the nucleoside triphosphates at the sugar or the base moiety. To those skilled in the art, other modifications of 
the nested Sanger fragments can be envisioned. In one embodiment of the invention, the linking chemistry allows one 
to cleave off the so-purified nested DNA enzymaticalry, chemically or physically. By way of example, the L-L' chemistry 

33 can be of a type of disulfide bond (chemically cleavable, for example, by mercaptoethanol or dithioerythrof), a biotln/ 
streptavidin system, a heterobifunctional derivative of a trityl ether group (Kdster etal., "A Versatile Acid-Labile Linker 
for Modification of Synthetic Biomo leches," Tetrahedron Letters 31 , 7095 (1990)) which can be cleaved under mildly 
acidic conditions, a levulinyl group cleavable under almost neutral conditions with a hydrazinium/acetate buffer, an 
arginine-argtnine or lysine-lysine bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond 

40 cleavable by a pyrophosphatase, a photocleavabte bond which can be, for example, physically cleaved and the like 
(see, e.g., FIGURE 23). Optionally, another cation exchange can be performed prior to mass spectrometric analysis. 
In the instance that an enzyme-cleavable bond is utilized to immobilize the nested fragments, the enzyme used to 
cleave the bond can serve as an internal mass standard during MS analysis. 

[0036] The purification process and/or ion exchange process can be carried out by a number of other methods instead 
43 of, or in conjunction with, immobilization on a solid support. For example, the base-specifically terminated products 
can be separated from the reactants by dialysis, filtration (Including ultrafiltration), and chromatography. Likewise, these 
techniques can be used to exchange the cation of the phosphate backbone with a counter-ion which reduces peak 
broadening. 

[0037] The base-specifically terminated fragment families can be generated by standard Sanger sequencing using 
so the Large Klenow fragment of E. cots DNA polymerase I, by Sequenase, Taq DNA polymerase and other DNA polymer- 
ases suitable for this purpose, thus generating nested DNA fragments for the mass spectrometric analysis. It is, how- 
ever, part of this invention that base-specifically terminated RNA transcripts of the DNA fragments to be sequenced 
can also be utilized for mass spectrometric sequence determination. In this case, various RNA polymerases such as 
the SP6 or the T7 RNA polymerase can be used on appropriate vectors containing, for example, the SP6 or the T7 
S3 promoters (e.g. Axelrod etal, "Transcription from Bacteriophage T7 and SP6 RNA Polymerase Promoters In the Pres- 
ence of 3'-Deoxyribonucteoside 5 ( -triphosphate Chain Terminators/' Biochemistry 24 , 5716-23 (1985)). In this case, 
the unknown DNA sequence fragments are inserted downstream from such promoters. Transcription can also be ini- 
tiated by a nucleic acid primer (Pitulle et at., "Initiator Oligonucleotides for the Combination of Chemical and Enzymatic 
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RNA Synthesis," Gene 112 , 1 01-105 (1992)) which carries, as one embodiment of this invention, appropriate linking 
functionalities, L, which allow the Immobilization of the nested RNA fragments, as outlined above, prior to mass spec- 
trometric analysis for purification and/or appropriate modification and/or conditioning. 

[0036] For this immobilization process of the DNA/RNA sequencing products for mass spectrometric analysis, various 
solid supports can be used, e.g., beads (silica gel, controlled pore glass, magnetic beads, Sephadex/Seph arose beads, 
cellulose beads, etc.), capillaries, glass fiber filters, glass surfaces, metal surfaces or plastic material. Examples of 
useful plastic materials include membranes in filter or microtiter plate formats, the latter allowing the automation of the 
purification process by employing microtiter plates which, as one embodiment of the invention, carry a permeable 
membrane in the bottom of the well fu notion alized with L\ Membranes can be based on polyethylene, polypropylene, 
potyamide, potyvinylidenedifluoride and the like. Examples of suitable metal surfaces include steel, gold, silver, alumi- 
num, and copper. After purification, cation exchange, and/or modification of the phosphodiester backbone of the L-L 1 
bound nested Sanger fragments, they can be cleaved off the solid support chemically, enzymatlcally or physically. Also, 
the L-L' bound fragments can be cleaved from the support when they are subjected to mass spectrometric analysis by 
using appropriately chosen L-L' linkages and corresponding laser energies/intensities as described above and in FIG- 
URES 19-23. 

[0039J The highly purified, four base-specifically terminated DNA or RNA fragment families are then analyzed with 
regard to their fragment lengths via determination of their respective molecular weights by MALDI or ES mass spec- 
trometry. 

[0040] For ES, the samples, dissolved in water or in a volatile buffer, are injected either continuously or discontinu- 
ously into an atmospheric pressure ionization Interface (API) and then mass analyzed by a quadrupole. With the aid 
of a computer program, the molecular weight peaks are searched for the known molecular weight of the nucleic acid 
primer (UP) and determined which of the four chain-terminating nucleotides has been added to the UP. This represents 
the first nucleotide of the unknown sequence. Then, the second, the third, the n m extension product can be identified 
in a similar manner and, by this, the nucleotide sequence Is assigned. The generation of multiple ion peaks which can 
be obtained using ES mass spectrometry can increase the accuracy of the mass determination. 
[0041] In MALDJ mass spectrometry, various mass analyzers can be used, e.g., magnetic sector/magnetic deflection 
instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-flight (TOF) configurations as 
is known In the art of mass spectrometry. FIGURES 2 A through 6 are given as an example of the data obtainable when 
sequencing a hypothetical DNA fragment of 50 nucleotides in length (SEQ ID NO:3) and having a molecular weight of 
15,344.02 daJtons. The molecular weights calculated for the doT (FIGURES 2A and 2B), ddA (FIGURES 3A and 3B), 
ddG (FIGURES 4A and 4B) and ddC (FIGURES 5A and SB) terminated products are given (corresponding to fragments 
of SEQ ID NO:3) and the idealized four MALDI-TOF mass spectra shown. All four spectra are superimposed, and from 
this, the DNA sequence can be generated. This is shown in the summarizing FIGURE 6, demonstrating how the mo- 
lecular weights are correlated with the DNA sequence. MALDI-TOF spectra have been generated for the ddT terminated 
products (FIGURES 1 6A-1 6M) corresponding to those shown in FIGURE 2 and these spectra have been superimposed 
(FIGURES 1 7A and 1 7B). The correlation of calculated molecular weights of the ddT fragments and their experimen- 
tally-verified weights are shown in Table 1 . Likewise, if all four chain-terminating reactions are combined and then 
analyzed by mass spectrometry, the molecular weight difference between two adjacent peaks can be used to determine 
the sequence. For the desorption/lonization process, numerous matrix/laser combinations can be used. 



TABLE I 



Correlation of calculated and experimentally verified molecular weights of the 13 DNA fragments of FIGURES 2 




and 16A-16M. 




Fragment (n-mer) 


calculated mass 


experimental mass 


difference 


7-mer 


21 04.45 


2119.9 


+15.4 


10-mer 


3011.04 


3026.1 


+15.1 


11-mer 


3315.24 


3330.1 


+14.9 


1 9-mer 


5771 .82 


5788.0 


+16.2 


20-mer 


6076.02 


6093.8 


+17.8 


24-mer 


7311.82 


7374.9 


+63.1 


26-mer 


7945.22 


7960.9 


+15.7 


33-mer 


10112.63 


10125.3 


+12.7 


37-mer 


11348.43 


11361.4 


+13.0 


38-mer 


11652.62 


. 11670.2 


+17.6 


42-mer 


12872.42 


12888.3 


+ 15.9 
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TABLE I (continued) 



Correlation of calculated and experimentally verified molecular weights of the 13 DNA fragments of FIGURES 2 




and16A-16M. 




Fragment (n-mer) 


calculated mass 


experimental mass 


difference 


46-mer 


14108.22 


14125.0 


+16.8 


50-mer 


15344.02 


15362.6 


+18.6 



10 [0042] In order to increase throughput to a level necessary for high volume genomic and cDN A sequencing projects, 
a further embodiment of the present invention is to utilize multiplex mass spectrometry to simultaneously determine 
more than one sequence. This can be achieved by several, albeit different, methodologies, the basic principle being 
the mass modification of the nucleic acid primer (UP), the chain-elongating and/or terminating nucleoside triphosphates, 
or by using mass-differentiated tag probes hybridlzabie to specific tag sequences. The term "nucleic acid primer as 

15 used herein encompasses primers for both DNA and RNA Sanger sequencing. 

[0043] By way of example, FIGURE 7A presents a general formula of the nucleic acid primer (UP) and the tag probes 
(TP). The mass modifying moiety can be attached, for instance, to either the 5'-end of the oligonucleotide (M 1 ), to the 
nucleobase (or bases) (M 2 , M 7 ), to the phosphate backbone (M 3 ), and to the 2-position of the nucleoside (nucleosides) 
(M 4 , M 6 ) or/and to the terminal 3-position (M 5 ). Primer length can vary between 1 and 50 nucleotides In length. For 

20 the priming of DNA Sanger sequencing, the primer is preferentially in the range of about 1 5 to 30 nucleotides in length. 
For artificially priming the transcription in a RNA polymerase-mediated Sanger sequencing reaction, the length of the 
primer is preferentially in the range of about 2 to 6 nucleotides. If a tag probe (TP) is to hybridize to the integrated tag 
sequence of a family chain-terminated fragments, its preferential length is about 20 nucleotides. 
[0044] The table in FIGURE 7B depicts some examples of mass-modified primer/tag probe configurations for DNA, 

25 as well as RNA, Sanger sequencing. This list is, however, not meant to be limiting, since numerous other combinations 
of mass-modifying functions and positions within the oligonucleotide molecule are possible and are deemed part of 
the invention. The mass-modifying functionality can be, for example, a halogen, an azido, or of the type, XR, wherein 
X is a linking group and R is a mass-modifying functionality. The mass-modifying functionality can thus be used to 
introduce defined mass increments into the oligonucleotide molecule. 

30 [0045] In another embodiment, the nucleotides used for chain-elongation and/or termination are mass-modified. 
Examples of such modified nucleotides are shown in FIGURE 8A and 8B. Here the mass-modifying moiety, M, can be 
attached either to the nucleobase, M 2 (in case of the c 7 -deazanucleosldes also to C-7, M 7 ), to the triphosphate group 
at the alpha phosphate, M 3 , or to the 2'-position of the sugar ring of the nucleoside triphosphate, M 4 and M 6 . Further- 
more, the mass-modifying functionality can be added so as to affect chain termination, such as by attaching it to the 

35 3'-position of the sugar ring in the nucleoside triphosphate, M 5 . The list In FIGURE 8B represents examples of possible 
configurations for generating chain-terminating nucleoside triphosphates for RNA or DNA Sanger sequencing. For 
those skilled in the art, however, it is clear that many other combinations can serve the purpose of the Invention equally 
well. In the same way, those skilled in the art will recognize that chain-elongating nucleoside triphosphates can also 
be mass-modified in a similar fashion with numerous variations and combinations in functionality and attachment po- 

40 sitions. 

[0046] Without limiting the scope of the invention, FIGURE 9 gives a more detailed description of particular examples 
of how the mass-modification, M, can be introduced for X in XR as well as using olfgo-/po!y ethylene glycol derivatives 
for R. The mass-modifying increment In this case is 44, i.e. five different mass-modified species can be generated by 
just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m=1), 133 (m=2), 177 (m=3) and 221 (m=4) to 

43 the nucleic acid primer (UP), the tag probe (TP) or the nucleoside triphosphates respectively. The oligo/polyethylene 
glycols can also be monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t-butyl and the like. A 
selection of linking functionalities, X, are also illustrated. Other chemistries can be used in the mass-modified com- 
pounds, as for example, those described recently in Oligonucleotides and Analogues, A Practical Approach , F. Eckstein, 
editor, IRL Press, Oxford, 1991. ~ 

50 [0047] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, 
can be selected and attached via appropriate linking chemistries, X. Without any limitation, some examples are given 
in FIGURE 10. A simple mass-modification can be achieved by substituting H for halogens like F, CI, Br and/or I, or 
pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, ethyl, propyl, 
isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or functional groups such as CH 2 F, CHF 2 , CF 3 , SKCHjfe, 

55 Si(CH 3 )^(C 2 H 5 ) 1 SKCHaXCgHgJg, Si^H^. Yet another mass-modification can be obtained by attaching homo- or 
heteropeptides through X to the UP, TP or nucleoside triphosphates. One example useful In generating mass-modified 
species with a mass increment of 57 is the attachment of oligoglycines, e.g., mass-modifications of 74 (r=1, m=0), 
131 (r=1, m=2), 188 (r=1, m=3), 245 (r=1, m=4) are achieved. Simple oligoamides also can be used, e.g., mass- 
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modifications of 74 (r=1, m=0), 88 (r=2, m=0), 102 (r=3, m=0), 116 (r^4, m=0), etc. are obtainable. For those skilled in 
the art, it will be obvious that there are numerous possibilities in addition to those given in FIGURE 10 and the above 
mentioned reference (Oligonucleotides and Analogues , F. Eckstein, 1991), for introducing, in a predetermined manner, 
many different mass-modifying functionalities to UP, TP and nucleoside triphosphates which are acceptable for DNA 
and RNA Sanger sequencing. 

[0048] As used herein, the superscript 0-i designates I + 1 mass differentiated nucleotides, primers or tags. In some 
Instances, the superscript 0 (e.g., NTP°, UP 0 ) can designate an unmodified species of a particular reactant, and the 
superscript i (e.g., NTP 1 , NTP 1 , NTP 2 , etc.) can designate the i-th mass-modified species of that reactant. If, for example, 
more than one species of nucleic acids (e.g., DNA clones) are to be concurrently sequenced by multiplex DNA se- 
quencing, then i + 1 different mass-modified nucleic acid primers (UP 0 , UP 1 •...UP') can be used to distinguish each set 
of base-specifically terminated fragments, wherein each species of mass-modified UP 1 can be distinguished by mass 
spectrometry from the rest. 

[0049] As illustrative embodiments of this invention, three different basic processes for multiplex mass spectrometric 
DNA sequencing employing the described mass-modified reagents are described below: 

A) Multiplexing by the use of mass-modified nucleic acid primers (UP) for Sanger DNA or RNA sequencing (see 
for example RGURE 11); 

B) Multiplexing by the use of mass-modified nucleoside triphosphates as chain elongators and/or chain terminators 
for Sanger DNA or RNA sequencing (see for example FIGURE 12); and 

C) Multiplexing by the use of tag probes which specifically hybridize to tag sequences which are integrated into 
part of the four Sanger DNA/RNA base-specif icafly terminated fragment families. Mass modification here can be 
achieved as described for FIGURES 7A, 7B, 9 and 10, or alternately, by designing different oligonucleotide se- 
quences having the same or different length with unmodified nucleotides which, in a predetermined way, generate 
appropriately differentiated molecular weights (see for example FIGURE 13). 

[0050] The process of multiplexing by mass-modified nucleic acid primers (UP) is illustrated by way of example In 
FIGURE 11 for mass analyzing four different DNA clones simultaneously. The first reaction mixture is obtained by 
standard Sanger DNA sequencing having unknown DNA fragment 1 (clone 1) integrated in an appropriate vector (e. 
g., M13mp1 B). employing an unmodified nucleic acid primer UP 0 , and a standard mixture of the four unmodified 
deoxynucleoside triphosphates, dNTP 0 , and with 1/1 0th of one of the four dideoxy nucleoside triphosphates, ddNTP 0 
A second reaction mixture for DNA fragment 2 (clone 2) is obtained by employing a mass-modified nucleic acid primer 
UP 1 and, as before, the four unmodified nucleoside triphosphates, dNTP 0 , containing in each separate Sanger reaction 
1/1 0th of the chain-terminating unmodified dideoxynucieoside triphosphates ddNTP 0 . In the other two experiments, 
the four Sanger reactions have the following compositions: DNA fragment 3 (clone 3), UP 2 , dNTP 0 , ddNTP 0 and DNA 
fragment 4 (clone 4), UP 3 , dNTP 0 , ddNTP 0 . For mass spectrometric DNA sequencing, all base-specifically terminated 
reactions of the four clones are pooled and mass analyzed. The various mass peaks belonging to the four dideoxy- 
terminated (e.g.. dcfT-terminated) fragment families are assigned to speciffcaJly elongated and doT-terminated frag- 
ments by searching (such as by a computer program) for the known molecular ion peaks of UP 0 , UP 1 , UP 2 and UP 3 
extended by either one of the four dideoxynucieoside triphosphates, UP°-ddN° UP^ddN 0 , UP 2 -ddN° and UP 3 -ddN°. 
In this way, the first nucleotides of the four unknown DNA sequences of clone 1 to 4 are determined. The process is 
repeated, having memorized the molecular masses of the four specific first extension products, until the four sequences 
are assigned. Unambiguous mass/sequence assignments are possible even in the worst case scenario In which the 
four mass-modified nucleic acid primers are extended by the same dideoxynucieoside triphosphate, the extension 
products then being, for example, UP°-ddT, UP^ddT, UP^doT and UF^-ddT, which differ by the known mass Increment 
differentiating the four nucleic acid primers. In another embodiment of this invention, an analogous technique Is em- 
ployed using different vectors containing, for example, the SP6 and/or T7 promoter sequences, and performing tran- 
scription with the nucleic acid primers UP 0 , UP 1 , UP 2 and UP 3 and either an RNA polymerase (e.g., SP6 or T7 RNA 
polymerase) with chain-elongating and terminating unmodified nucleoside triphosphates NTP 0 and S'-dNTP 0 Here, 
the DNA sequence Is being determined by Sanger RNA sequencing. 

[0051 ] FIGURE 12 illustrates the process of multiplexing by mass -modified chain-elongating or/and terminating nu- 
cleoside triphosphates in which three different DNA fragments (3 clones) are mass analyzed simultaneously- The first 
DNA Sanger sequencing reaction (DNA fragment 1, clone 1) is the standard mixture employing unmodified nucleic 
acid primer UP 0 , dNTP 0 and in each of the four reactions one of the four ddNTP 0 . The second (DNA fragment 2, clone 
2) and the third (DNA fragment 3, done 3) have the following contents: UP 0 , dNTP 0 , ddNTP 1 and UP 0 , dNTP 0 , ddNTP 2 
respectively. In a variation of this process, an amplification of the mass increment in mass-modifying the extended 
DNA fragments can be achieved by either using an equally mass-modified deoxynucleoside triphosphate (i.e., dNTP 1 , 
dNTP 2 ) for chain elongation alone or in conjunction with the homologous equally mass-modified dideoxynucieoside 
triphosphate. For the three clones depicted above, the contents of the reaction mixtures can be as follows: either UP 0 / 
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dNTpO/ddNTP 0 , UP°/dNTPVddNTP° and UP°/dNTP 2 /ddNTP° or UP°/dNTP°/ddNTP 0 , UP°/dNTPVddNTPi and UP 0 / 
dNTP 2 /ddNTP 2 . As described above, DNA sequencing can be performed by Sanger RNA sequencing employing un- 
modified nucleic acid primers, UP 0 , and an appropriate mixture of chain-elongating and terminating nucleoside triphos- 
phates. The mass-modification can be again either in the chain-terminating nucleoside triphosphate alone or in con- 
junction with mass-modified chain-elongating nucleoside triphosphates. Multiplexing is achieved by pooling the three 
base-specifically terminated sequencing reactions (e.g., the ddTTP terminated products) and simultaneously analyzing 
the pooled products by mass spectrometry. Again, the first extension products of the known nucleic acid primer se- 
quence are assigned, e.g., via a computer program. Mass/sequence assignments are possible even in the worst case 
in which the nucleic acid primer is extended/terminated by the same nucleotide, e.g., ddT, in all three clones. The 
following configurations thus obtained can be well differentiated by their different mass-modifications: UP°-ddT°, UP°- 
dcm.UPO-ddT 2 . 

[0052] In yet another embodiment of this invention, DNA sequencing by multiplex mass spectrometry can be achieved 
by cloning the DNA fragments to be sequenced in "plex-vectors" containing vector specific "tag sequences" as de- 
scribed (Koster etat, "Oligonucleotide Synthesis and Multiplex DNA Sequencing Using Chemiluminescent Detection," 
Nucleic Acids Res. Symposium Ser. No. 24, 318-321 (1991)); then pooling clones from different plex-vectors for DNA 
preparation and the four separate Sanger sequencing reactions using standard dNTP°/ddNTP° and nucleic acid primer 
UP 0 ; purifying the four multiplex fragment families via linking to a solid support through the linking group, L, at the 5- 
end of UP; washing out all by-products, and cleaving the purified muttlplex DNA fragments off the support or using the 
L-L' bound nested Sanger fragments as such for mass spectrometric analysis as described above; performing demul- 
tiplexing by one-by-one hybridization of specific "tag probes"; and subsequently analyzing by mass spectrometry (see, 
for example, FIGURE 13). As a reference point, the four base-specifically terminated multiplex DNA fragment families 
are run by the mass spectrometer and all ddT 0 -, ddA 0 -, ddC°- and ddG°-terminated molecular ion peaks are respectively 
detected and memorized. Assignment of, for example, ddT°-terminated DNA fragments to a specific fragment family 
is accomplished by another mass spectrometric analysis after hybridization of the specific tag probe (TP) to the cor- 
responding tag sequence contained in the sequence of this specific fragment family. Only those molecular ion peaks 
which are capable of hybridizing to the specific tag probe are shifted to a higher molecular mass by the same known 
mass increment (e.g. of the lag probe). These shifted ion peaks, by virtue of all hybridizing to a specific tag probe, 
belong to the same fragment family. For a given fragment family, this Is repeated for the remaining chain terminated 
fragment families with the same tag probe to assign the complete DNA sequence. This process is repeated i-1 times 
corresponding to i clones multiplexed (the i-th done is identified by default). 

[0053] The differentiation of the tag probes for the different multiplexed clones can be obtained just by the DNA 
sequence and Its ability to Watson-Crick base pair to the tag sequence. It is well known in the art how to calculate 
stringency conditions to provide for specific hybridization of a given tag probe with a given tag sequence (see, for 
example, Molecular Cloning: A laboratory manual 2ed, ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: NY, 1989, Chapter 11). Furthermore, differentiation can be obtained by designing the tag sequence 
for each plex -vector to have a sufficient mass difference so as to be unique just by changing the length or base com- 
position or by mass-modifications according to FIGURES 7A, 7B, 9 and 10. In order to keep the duplex between the 
tag sequence and the tag probe intact during mass spectrometric analysis, it is another embodiment of the invention 
to provide for a covalent attachment mediated by, for example, photo reactive groups such as psoralen and ellipticine 
and by other methods known to those skilled in the art (see, for example, Helene etal., Nature 344 , 358 (1990) and 
Thuong et a!. "Oligonucleotides Attached to Intercalates, Photoreactive and Cleavage Agents* in F. Eckstein, Oligo- 
nucleotides and Analogues: A Practical Approach , IRL Press, Oxford 1991 , 283-306). 

[0054] The DNA sequence is unraveled again by searching for the lowest molecular weight molecular ion peak 
corresponding to the known UP°-tag sequence/tag probe molecular weight plus the first extension product, e.g., ddl°, 
then the second, the third, etc. 

[0055] In a combination of the latter approach with the previously described multiplexing processes, a further Increase 
in multiplexing can be achieved by using, in addition to the tag probe/tag sequence interaction, mass-modified nucleic 
acid primers (FIGURES 7A and 7B) and/or mass-modified deoxy nucleoside, dNTP 0-1 , and/or dideoxynucleoside tri- 
phosphates, ddNTP 0 " 1 . Those skilled in the art will realize that the tag sequence/tag probe multiplexing approach Is 
not limited to Sanger DNA sequencing generating nested DNA fragments with DNA polymerases. The DNA sequence, 
can also be determined by transcribing the unknown DNA sequence from appropriate promoter-containing vectors 
(see above) with various RNA polymerases and mixtures of NTPG-VS'-dNTP 0 " 4 , thus generating nested RNA fragments. 
[0056] In yet another embodiment of this invention, the mass-modifying functionality can be Introduced by a two or 
multiple step process. In this case, the nucleic acid primer, the chain-elongating or terminating nucleoside triphosphates 
and/or the tag probes are, In a first step, modified by a precursor functionality such as azWo, -N 3 , or modified with a 
functional group in which the R In XR is H (FIGURES 7A, 7B, 9) thus providing temporary functions, e.g., but not limited 
to -OH, -NH 2 , -NHR, -SH, -NCS, -OCO(CH 2 ) f COOH (r= 1-20), -NHCO(CH 2 ) r COOH(r = 1-20), -OSO 2 0H, -OCOfCHj,)^ 
(r= 1 -20), -OP(0-Alkyl)N(Alkyl)2. These less bulky functionalities result in better substrate properties for the enzymatic 
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DNA or RNA synthesis reactions of the DNA sequencing process. The appropriate mass-modifying functionality is then 
introduced after the generation of the nested base-specifically terminated DNA or RNA fragments prior to mass spec- 
trometry. Several examples of compounds which can serve as mass-modifying functionalities are depicted In FIGURES 

9 and 10 without limiting the scope of this invention. 

(0057] Another aspect of this Invention concerns kits for sequencing nucleic acids by mass spectrometry which in- 
clude combinations of the above-described sequencing reactants. For instance, in one embodiment, the kit comprises 
reactants for multiplex mass spectrometric sequencing of several different species of nucleic acid. The kit can include 
a solid support having a linking functionality (L 1 ) for immobilization of the base-specifically terminated products; at least 
one nucleic acid primer having a linking group (L) for reversibry and temporarily linking the primer and solid support 
through, for example, a photocleavable bond; a set of chain-elongating nucleotides (e.g., dATP, dCTP, dGTP and dTTP, 
or ATP, CTP, GTP and UTP); a set of chain-terminating nucleotides (such as 2' f 3'-dideoxy nucleotides for DNA synthesis 
or 3'-deoxy nucleotides for RNA synthesis); and an appropriate polymerase for synthesizing complementary nucle- 
otides. Primers anoVor terminating nucleotides can be mass-modified so that the base-specif icaily terminated fragments 
generated from one of the species of nucleic acids to be sequenced can be distinguished by mass spectrometry from 
ail of the others. Alternative to the use of mass-modified synthesis reactants, a set of tag probes (as described above) 
can be included in the kit. The kit can also include appropriate buffers as well as instructions for performing multiplex 
mass spectrometry to concurrently sequence multiple species of nucleic acids. 

[0058] In another embodiment, a nucleic acid sequencing kit can comprise a solid support as described above, a 
primer for initiating synthesis of complementary nucleic acid fragments, a set of chain-elongating nucleotides and an 
appropriate polymerase. The mass-modified chain-terminating nucleotides are selected so that the addition of one of 
the chain terminators to a growing complementary nucleic acid can be distinguished by mass spectrometry. 

EXAMPLE 1 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric 
analysis via disulfide bonds. 

[0059] As a solid support, Sequelon membranes (Millipore Corp., Bedford, MA) with phenyl isolhlocyanate groups 
are used as a starting material. The membrane disks, with a diameter of 8 mm, are wetted with a solution of N-meth- 
ylmorpholine/water/2-propanoI (NMM solution) (2/49/49 v/v/v), the excess liquid removed with filter paper and placed 
on a piece of plastic film or aluminum foil located on a heating block set to 55°C. A solution of 1 mM 2-mercaptoethyt- 
amine (cysteamine) or 2, 2'-dithio-bis(ethylamlne) (cystamine) or S-(2-thiopyridyl)-2-thio-ethylamine (10 uf, 10 nmol) 
in NMM is added per disk and heated at 55°C. After 15 min, 10 ul of NMM solution are added per disk and heated for 
another 5 min. Excess of Isothiocyanate groups may be removed by treatment with 10 ut of a 10 mM solution of glycine 
in NMM solution. For cystamine, the disks are treated with 10 ul of a solution of 1M aqueous dithiothreltol (DTTy 
2-propanol (1:1 v/v) for 1 5 min at room temperature. Then, the disks are thoroughly washed in a filtration manifold with 
5 aliquots of 1 ml each of the NMM solution, then with 5 allquots of 1 ml acetonitrile/water (1/1 v/v) and subsequently 
dried. If not used immediately the disks are stored with free thiol groups in a solution of 1M aqueous dithiothreitol/ 
2-propanol (1:1 v/v) and, before use, DTT is removed by three washings with 1 ml each of the NMM solution. The 
primer oligonucleotides with 5'-SH functionality can be prepared by various methods (e.g., B.C.F Chu era/., Nucleic 
Adds Res. 14, 5591-5603 (1986), Sproat et at.. Nucleic Acids Res. 15, 4837-48 (1987) and Oligonucleotides and 
Analogues: A Practical Approach (F. Eckstein, editor), IRL Press Oxford, 1991). Sequencing reactions according to 
the Sanger protocol are performed in a standard way (e.g., H. Swerdlow etaJ., Nucleic Adds Res. 1 8, 1415-19 (1990)). 
In the presence of about 7-10 mM DTT the free 5* -thiol primer can be used; in other cases, the SH functionality can 
be protected, e.g., by a trityl group during the Sanger sequencing reactions and removed prior to anchoring to the 
support in the following way. The four sequencing reactions (150 ul each in an Eppendorf tube) are terminated by a 

10 min incubation at 70 Q C to denature the DNA polymerase (such as Kienow fragment, Sequenase) and the reaction 
mixtures are ethanol precipitated. The supematants are removed and the pellets vortexed with 25 ul of an 1 M aqueous 
silver nitrate solution, and after one hour at room temperature, 50 ul of an 1M aqueous solution of DTT is added and 
mixed by vortexlng. After 1 5 min, the mixtures are centrifuged and the pellets are washed twice with 1 00 ul ethylacetate 
by vortexing and centrifugation to remove excess DTT. The primer extension products with free 5'-thlol group are now 
coupled to the thiolated membrane supports under mild oxidizing conditions. In general, it is sufficient to add the 5'- 
thlolated primer extension products dissolved in 10 ul 10 mM de-aerated triethylammonium acetate buffer (TEAA) pH 
7.2 to the thiolated membrane supports. Coupling is achieved by drying the samples onto the membrane disks with a 
cold fan. This process can be repeated by wetting the membrane with 10 ul of 10 mM TEAA buffer pH 7.2 and drying 
as before. When using the 2-thiopyridyl derivatized compounds, anchoring can be monitored by the release of pyridine- 
2-thione spectrophotometrically at 343 nm. 

[0060] In another variation of this approach, the oligonucleotide primer is functionalized with an amino group at the 
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5'-end which is introduced by standard procedures during automated DNA synthesis. After primer extension, during 
the Sanger sequencing process, the primary amino group Is reacted with 3-(2-pyridyldithio) propionic acid N-hydrox- 
ysuccinimide ester (SPDP) and subsequently coupled to thethioiated supports and monitored by the release of pyridyl- 
2-th ion e as described above. After den atu ration of DNA polymerase and ethanol precipitation of the sequencing prod- 
ucts, the supernatants are removed and the pellets dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and 10 ul of a 2 mM 
solution of SPDP in 1 0 mM TEAA are added. The reaction mixture Is vortexed and incubated for 30 min at 25° C. Excess 
SPDP is then removed by three extractions (vortexing, centrifugation) with 50 ul each of ethanol and the resulting 
pellets are dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and coupled to the thiofated supports (see above). 
[0061] The primer-extension products are purified by washing the membrane disks three times each with 100 ul 
NMM solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2. The purified primer-extension products 
are released by three successive treatments with 10 ul of 10 mM 2-mercaptoethanoI in 10 mM TEAA buffer pH 7.2, 
lyophilized and analyzed by either ES or MALDI mass spectrometry. 

[0062] This procedure can abo be used for the mass-modified nucleic acid primers UP 0-1 in an analogous and ap- 
propriate way, taking into account the chemical properties of the mass-modifying functionalities. 

EXAMPLE 2 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometrte 
analysis via the levulinyl group 

[0063] 5-Aminolevulinlc acid is protected at the primary amino group with the Fmoc group using 9-fluorenylmethyl 
N-succinimidyl carbonate and is then transformed into the N-hydroxysuccinimide ester (NHS ester) using N-hydroxy- 
succinimide and dteyclohexyl carbodilmide under standard conditions. For the Sanger sequencing reactions, nucleic 
acid primers, UP 0 * 1 , are used which are functionalized with a primary amino group at the 5'-end introduced by standard 
procedures during automated DNA synthesis with aminollnker phosphoamidites as the final synthetic step. Sanger 
sequencing is performed under standard conditions (see above). The four reaction mixtures (150 ul each in an Eppen- 
dorf tube) are heated to 70*0 for 10 min to inactivate the DNA polymerase, ethanol precipitated, centrifuged and 
resuspended in 10 ul of 10 mM TEAA buffer pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5-aminolevulinyl-NHS ester 
in 1 0 mM TEAA buffer is added, vortexed and incubated at 25°C for 30 min. The excess of the reagent is removed by 
ethanol precipitation and centrifugation. The Fmoc group is cleaved off by resuspending the pellets in 1 0 ul of a solution 
of 20% piperidine In N.N-dimethylfoirnamide/water (1:1 v/v). After 15 min at 25°C, piperidine is thoroughly removed 
by three precipitation s/centrif ugations with 1 00 ul each of ethanol, the pellets are resuspended in 1 0 ul of a solution of 
N-methyfmorphollne, 2-propanol and water (2/10/88 wv/v) and are coupled to the solid support carrying an Isothlocy- 
anate group. In the case of the DITC-Sequelon membrane (Millipore Corp., Bedford, MA), the membranes are prepared 
as described in EXAMPLE 1 and coupling is achieved on a heating block at 55°C as described above. RNA extension 
products are immobilized in an analogous way. The procedure can be applied to other solid supports with isothiocyanate 
groups in a similar manner. 

[0064] The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM 
solution and three times with 100 ul 10 mM TEAA buffer pH 7.2. The purified primer-extension products are released 
by three successive treatments with 10 ul of 100 mM hydrazlnium acetate buffer pH 6.6, fyophilized and analyzed by 
either ES or MALDI mass spectrometry. 

EXAMPLE 3 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectro metric 
analysis via a trypsin sensitive linkage 

[0065] Sequelon DITC membrane disks of 8 mm diameter (Millipore Corp., Bedford, MA) are wetted with 10 ul of 
NMM solution (N-methylmorpholine/propanaol-2/water, 2/49/49 v/v/v) and a linker arm Introduced by reaction with 10 
ul of a 10 mM solution of 1 ,6-dlaminohexane in NMM. The excess diamine is removed by three washing steps with 
100 ul of NMM solution. Using standard peptide synthesis protocols, two L-lyafne residues are attached by two suc- 
cessive condensations with N-Fmoc-N-tBoc-L- lysine pentafluorophenylester, the terminal Fmoc group is removed with 
piperidine In NMM and the free a-amino group coupled to 1,4-phenylene dUsothiocyanate (DITC). Excess DrTC Is 
removed by three washing steps with 100 ul 2-propanol and the N-tBoc groups removed with trifluoroacetic acid ac- 
cording to standard peptide synthesis procedures. The nucleic acid primer-extension products are prepared from oli- 
gonucleotides which carry a primary amino group at the 5'-terminus. The four Sanger DNA sequencing reaction mix- 
tures (150 ul each in Eppendorf tubes) are heated for 10 min at 70°C to inactivate the DNA polymerase, ethanoJ 
precipitated, and the pellets resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanoJ and water (2/10/88 
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v/v/v). This solution Is transferred to the Lys-Lys-DITC membrane disks and coupled on a heating block set at 55°C. 
After drying, 10 ut of NMM solution is added and the drying process repeated. 

[0066] The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM 
solution and three times with 100 u I each of 10 mM TEAA buffer pH 7.2. For mass spectrometry analysis, the bond 
between the primer-extension products and the solid support is cleaved by treatment with trypsin under standard con- 
ditions and the released products analyzed by either ES or MALDI mass spectrometry with trypsin serving as an internal 
mass standard. 

EXAMPLE 4 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometrJc 
analysis via pyrophosphate linkage 

[0067] The DITC Sequelon membrane (disks of 8 mm diameter) are prepared as described in EXAMPLE 3 and 10 
ut of a 10 mM solution of 3-aminopyridine adenine dinucleotide (APAD) (Sigma) in NMM solution added. The excess 
APAD is removed by a 10 ul wash of NMM solution and the disks are treated with 1 0 ul of 1 0 mM sodium periodate in 
NMM solution (15 min, 25°C). Excess periodate is removed and the primer-extension products of the four Sanger DNA 
sequencing reactions (150 ul each in Eppendorf tubes) employing nucleic acid primers with a primary amino group at 
the 5* -end are ethanol precipitated, dissolved in 10 ul of a solution of N-methylmorpholine/2-propanol/water (2/10/88 
v/wv) and coupled to the 2' 3'-dialdehydo groups of the immobilized NAD analog.' 

[0068] The primer-extension products are extensively washed with the NMM solution (3 times with 100 ul each) and 
10 mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the purified primer-extension products are released by 
treatment with either NADase or pyrophosphatase in 10 mM TEAA buffer at pH 7.2 at 37° C for 15 min, lyophiiized and 
analyzed by either ES or MALDI mass spectrometry, the enzymes serving as internal mass standards. 

EXAMPLE S 

Synthesis of nucleic acid primers mass-modified by glycine residues at the 5'- position of the sugar moiety of 
the terminal nucleoside 

[0069] Oligonucleotides are synthesized by standard automated DNA synthesis using fi-cyanoethylphosphoamidites 
(H. Koster et a/., Nucleic Acids Res. 12, 4539 (1984)) and a 5'-amino group is introduced at the end of solid phase 
DNA synthesis (e.g. Agrawal et a/., Nucleic Acids Res. 14, 6227-45 (1986) or Sproat et a/., Nucleic Acids Res. 15, 
6181 -96 (1987)). The total amount of an oligonucleotide synthesis, starting with 0.25 umol CPG-bound nucleoside, is 
deprotected with concentrated aqueous ammonia, purified via OligoPAK™ Cartridges (Milfipore Corp., Bedford, MA) 
and lyophiiized. This material with a SMerminal amino group is dissolved in 100 ul absolute N.N-dimethylformamfde 
(DMF) and condensed with 1 0 ujnole N-Fmoc-glycine pentafluorophenyl ester for 60 min at 25°C. After ethanol pre- 
cipitation and centrifugation, the Fmoc group is cleaved off by a 10 min treatment with 100 ul of a solution of 20% 
piperidine in N,N-dimethylform amide. Excess piperidine, DMF and the cleavage product from the Fmoc group are 
removed by ethanol precipitation and the precipitate lyophiiized from 10 mM TEAA buffer pH 7.2. This material is now 
either used as primer for the Sanger DNA sequencing reactions or one or more glycine residues (or other suitable 
protected amino acid active esters) are added to create a series of mass-modified primer oligonucleotides suitable for 
Sanger DNA or RNA sequencing. Immobilization of these mass-modified nucleic acid primers UP 0 "' after primer-ex- 
tension during the sequencing process can be achieved as described, e.g., in EXAMPLES 1 to 4. 

EXAMPLE 6 

Synthesis of nucleic acid primers mass-modified at C-5 of the heterocyclic base of a pyrimldlne nucleoside 
with glycine residues 

[0070] Starting material was 5-(3-ammopropynyM )-3* 5'-di-p-tolyldeoxyuridine prepared and 3* 5'-de-0-acylated ac- 
cording to literature procedures (Haralambidis et a/., Nucleic Acids Res. 15, 4857-76 (1987)). 0.281 g (1.0 mmole) 
5-(3-aminopropynyl-1)-2'-deoxyuridine were reacted with 0.927 g (2.0 mmole) N-Fmoc-glyclne pentafluorophenylester 
in 5 ml absolute N,N-dimethyrformamide in the presence of 0.129 g (1 mmole; 174 ul) N,N-diisopropylethylamIne for 
60 min at room temperature. Solvents were removed by rotary evaporation and the product was purified by silica gel 
chromatography (Kieselgel 60, Merck; column: 2.5x 50 cm, elution with chloroform/methanol mixtures). Yield was 0.44 
g (0.78 mmole, 78 %). In order to add another glycine residue, the Fmoc group is removed with a 20 min treatment 
with 20% solution of piperidine in DMF, evaporated in vacuo and the remaining solid material extracted three times 
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with 20 ml ethylacetate. After having removed the remaining ethylacetate, N-Fmoc-glycine pentafluorophenylester is 
coupled as described above, 5- (S-fN-FmcKj-glycylJ-amidopmpynyl-t^-deoxy uridine is transformed into the 5*-0 
dlmethoxytritytated nucleoside-3 -0-p-cyanoethyl-N,N-diisopropy!phosphoamidite and incorporated into automated ol- 
igonucleotide synthesis by standard procedures (H. Koster et a/., Nucleic Acids Res, 12, 2261 (1984)). This glycine 
modified thymidine analogue building block for chemical DNA synthesis can be used to substitute one or more of the 
thymidine/uridine nucleotides in the nucleic acid primer sequence. The Fmoc group is removed at the end of the solid 
phase synthesis with a 20 min treatment with a 20 % solution of piperidine in DM Fat room temperature. DMFis removed 
by a washing step with acetonitrile and the oligonucleotide deprotected and purified in the standard way. 

EXAMPLE 7 

Synthesis of a nucJeic acid primer mass-modified at C-5 of the heterocyclic base of a pyrlmldlne nucleoside 
with ^-alanine residues 

[0071 J Starting material was the same as in EXAMPLE 6. 0.281 g (1 .0 mmole) 5-(3-Aminopropynyl-1 )-2*-deoxyuri- 
dlne was reacted with N-Fmoc-p-alanlne pentafluorophenylester (0.955 g, 2.0 mmole) in 5 ml N.N-dimethytformamlde 
(DMF) in the presence of 0.129 g (174 ul; 1.0 mmole) N,N-disopropylethylamine for 60 mln at room temperature. 
Solvents were removed and the product purified by silica gel chromatography as described in EXAMPLE 6. Yield was 
0.425 g (0.74 mmole, 74 %). Another ^-alanine moiety can be added in exactly the same way after removal of the 
Fmoc group. The preparation of the 5'-0-dimethoxytritylated nucleoside-3'-0-^4yanoethyUN,N-diisopropylphosphp- 
amidite from 5-(3-(N-Fmoc-^-alanyl)-amidopropynyl-1)-2'-deoxyuridine and incorporation into automated oligonucle- 
otide synthesis is performed under standard conditions. This building block can substitute for any of the thymidine/ 
uridine residues in the nucleic acid primer sequence. In the case of only one incorporated mass-modified nucleotide, 
the nucleic acid primer molecules prepared according to EXAMPLES 6 and 7 would have a mass difference of 14 
daltons. 

EXAMPLE 8 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a pyrlmldlne nucleoside 
with ethylene glycol monomethyl ether 

[0072] As a nucleoside component, 5-(3-aminopropynyl-1 )-2'-deoxyuridine was used in this example (see EXAM- 
PLES 6 and 7). The mass-modifying functionality was obtained as follows: 7.61 g (100.0 mmole) freshly distilled eth- 
ylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrys- 
tallized succinic anhydride in the presence of 1.22 g (10.0 mmole) 4-N,N-dlmethylaminopyridine overnight at room 
temperature. The reaction was terminated by the addition of water (5.0 ml), the reaction mixture evaporated in vacuo, 
co-evaporated twice with dry toluene (20 ml each) and the residue redissolved in 100 ml dichloromethane. The solution 
was extracted successively, twice with 1 0 % aqueous citric acid (2 x 20 mi) and once with water (20 ml) and the organic 
phase dried over anhydrous sodium sulfate. The organic phase was evaporated in vacuo, the residue redissolved in 
50 ml dichloromethane and precipitated into 500 ml pentane and the precipitate dried in vacuo. Yield was 13.12 g (74.0 
mmole; 74 %). 8.86 g (50.0 mmole) of succlnylated ethylene glycol monomethyl ether was dissolved in 100 ml dioxane 
containing 5% dry pyridine (5 ml) and 6.96»g (50.0 mmole) 4-nitrophenol and 10.32 g (50.0 mmole) dicyclohexylcar- 
bodiimide was added and the reaction run at room temperature for 4 hours. Dicydohexylurea was removed by filtration, 
the filtrate evaporated in vacuo and the residue redissolved In 50 ml anhydrous DMF. 12.5 ml (about 12.5 mmole 
4-nltropheny fester) of this solution was used to dissolve 2.81 g (10.0 mmole) 5-(3-amlnopropynyM)-2'-deoxyuridine. 
The reaction was performed In the presence of 1 .01 g (1 0.0 mmole; 1 .4 ml) triethylamine at room temperature overnight. 
The reaction mixture was evaporated in vacuo, co-evaporated with toluene, redissolved in dichloromethane and chro- 
matograprted on silicagel (Si60, Merck; column 4x50 cm) with dichloromethane/methanol mixtures. The fractions con- 
taining the desired compound were collected, evaporated, redissolved in 25 ml dichloromethane and precipitated Into 
250 ml pentane. The dried precipitate of 5-(3-N-(0-succinyl ethylene glycol monomethyl ether)-amldopropynyl-1 )-2*- 
deoxyuridine (yield: 65 %) is S'-O-dimethoxytritylated and transformed Into the nucteoside-3'-0-p-cyanoethy1-N,N-di- 
isopropylphosphoamidrte and incorporated as a building block in the automated oligonucleotide synthesis according 
to standard procedures. The mass-modified nucleotide can substitute for one or more of the thymidine/uridine residues 
in the nucleic acid primer sequence. Deprotection and purification of the primer oligonucleotide also follows standard 
procedures. 
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EXAMPLE 9 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a pyrtmldlne nucleoside 
with dlethylene glycol monomethyl ether 

[0073) Nucleoside starting material was as in previous examples. 5-(3-aminopropynyl-1 J^-deoxyuridine. The mass* 
modifying functionality was obtained similar to EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled diethyJene glycol 
monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 1 0.01 g (1 00.0 mmole) recrystalJized succinic 
anhydride In the presence of 1 22 g (10.0 mmole) 4-N, N-dimethylaminopyridine (DMAP) overnight at room tempera- 
ture. The work-up was as described in EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3 %). 11.06 g (50.0 mmole) 
of succinylated diethylene glycol monomethyl ether was transformed into the 4-nltrophenylester and, subsequently, 
12.5 mmole was reacted with 2.81 g (10.0 mmole)of5-(3-aminopropynyl-1)-2 > -deoxyuridlne as described in EXAMPLE 
8. Yield after silica gel column chromatography and precipitation into pentane was 3.34 g (6.9 mmole, 69 %). After 
dimethoxytritylation and transformation into the nucleoside-p-cyanoethylphosphoamidite, the mass-modified building 
block is incorporated into automated chemical DNA synthesis according to standard procedures. Within the sequence 
of the nucleic acid primer UP 0 *', one or more of the thymidine/uridine residues can be substituted by this mass-modified 
nucleotide. In the case of only one incorporated mass-modified nucleotide, the nucleic acid primers of EXAMPLES 8 
and 9 would have a mass difference of 44.05 daltons. 

EXAMPLE 10 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of deoxyadenosine with 
glycine 

[0074J Starting material was N 6 -benzoy^-B-bromo-5 , -0-(4,4 , -dimethoxytrityl)-2 , -deoxyadenosine prepared according 
to literature (Singh era/., Nucleic Acids Res. 18, 3339-45 (1990)). 632.5 mg <1.0 mmole) of this B-bromo-deoxyade- 
nosine derivative was suspended in 5 ml absolute ethanol and reacted with 251 .2 mg (2.0 mmole) glycine methyl ester 
(hydrochloride) in the presence of 241.4 mg (2.1 mmole; 366 ul) N, N-dllsopropyiethylamine and refluxed until the 
starting nucleosfdic material had disappeared (4-6 hours) as checked by thin layer chromatography (TLC). The solvent 
was evaporated and the residue purified by silica gel chromatography (column 2.5x50 cm) using solvent mixtures of 
chloroform/methanol containing 0.1 % pyridine. The product fractions were combined, the solvent evaporated, the 
fractions dissolved in 5 ml dichloromethane and precipitated into 100 ml pentane. Yield was 467 mg (0.76 mmole, 76 
%). Transformation into the corresponding nucleoside-p-cyanoethylphosphoamidite and Integration Into automated 
chemical DNA synthesis is performed under standard conditions. During final deprotection with aqueous concentrated 
ammonia, the methyl group is removed from the glycine moiety. The mass-modified building block can substitute one 
or more deoxyadenosine/adenosine residues in the nucleic acid primer sequence. 

EXAMPLE 11 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of deoxyadenosine with 
gfvcylglyclne 

[0075] This derivative was prepared in analogy to the glycine derivative of EXAMPLE 1 0. 632.5 mg (1 .0 mmole) N 6 - 
Benzoyl-e-bromo-S'-O ^^'-dimethoxytrltyO^'-deoxyadenosine was suspended in 5 ml absolute ethanol and reacted 
with 324.3 mg (2.0 mmole) glycyl-glycine methyl ester In the presence of 241 A mg (2. 1 mmole, 366 ul) N, N-diisopro- 
pylethylamine. The mixture was refluxed and completeness of the reaction checked by TLC. Work-up and purification 
was similar to that described In EXAMPLE 10. Yield after silica gel column chromatography and precipitation into 
pentane was 464 mg (0.65 mmole, 65 %). Transformation into the nucleoside-P-cyanoethylphosphoamidite and into 
synthetic oligonucleotides Is done according to standard procedures. In the case where only one of the deoxyadeno- 
sine/adenosine residues in the nucleic acid primer is substituted by this mass-modified nucleotide, the mass difference 
between the nucleic acid primers of EXAMPLES 10 and 11 is 57.03 daltons. 

EXAMPLE 12 

Synthesis of a nucleic acid primer mass-modified at the C-2* of the sugar moiety of r-amlno^-deoxythymldlne 
with ethylene glycol monomethyl ether residues 

[0076] Starting material was 5 , -O-(4,4-dimethoxytrity0-2 > -amino-2 , -deoxythymldlne synthesized according to pub- 
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lished procedures (e.g., Verheyden etaL. J. Org. Chem. 36, 250-254 (1 971); Sasaki era/., J. Org. Chem . 41 , 3138-3143 
(1976); Imazawa era/., J. Org. Chem. 44, 2039-2041 (1979); Hobbs era/., J. Org. Chem. 42, 714-719 (1976); Ikehara 
eta!., Chem. Pharm. Bull. Japan 26. 240-244 (1978); see also PCT Application WO 88/00201). 5'-0-(4,4-Dimethox- 
ytrityl)-2*-amino-2'-deoxythymidine (559.62 mg; 1 .0 mmole) was reacted with 2.0 mmole of the 4-nitrophenyl ester of 
succinylated ethylene glycol monomethyl ether (see EXAMPLE 8) in 10 ml dry DMF in the presence of 1 .0 mmole (140 
u.1) triethylamine for 1 8 hours at room temperature. The reaction mixture was evaporated In vacuo, co-evaporated with 
toluene, redissolved In dichloromethane and purified by silica gel chromatography (Si60, Merck; column: 2.5x50 cm; 
eluent: chloroform/methanol mixtures containing 0.1% triethylamine) . The product containing fractions were combined, 
evaporated and precipitated into pentane. Yield was 524 mg (0.73 mmol; 73 %). Transformation into the nucleoside- 
P-cyanoethyl-N.N-diisopropylphosphoamidite and incorporation into the automated chemical DNA synthesis protocol 
is performed by standard procedures. The mass-modified deoxythymidine derivative can substitute for one or more of 
the thymidine residues in the nucleic acid primer. 

[0077] In an analogous way, by employing the 4-nitrophenyl ester of succinylated diethylene gfycol monomethyl ether 
(see EXAMPLE 9) and Methylene glycol monomethyl ether, the corresponding mass-modified oligonucleotides are 
prepared. In the case of only one incorporated mass-modified nucleoside within the sequence, the mass difference 
between the ethylene, diethyiene and triethyJene glycol derivatives Is 44.05, 88.1 and 132.15 daltons respectively. 

EXAMPLE 13 

Synthesis of a nucleic acid primer mass-modified in the intemucleotidlc linkage via alkylation of 
phosphorothioate groups 

[0078] Phosphorothloate-containing oligonucleotides were prepared according to standard procedures (see e.g. Gait 
et Nucleic Acids Res. , 19 1183 (1991)). One, several or all intemucleotlde linkages can be modified in this way. 
The (-)-M13 nucleic acid primer sequence (17-mer) 5-dGTAAAACGACGGCCAGT was synthesized in 0.25 junole 
scale on a DNA synthesizer and one phosphorothioate group introduced after the final synthesis cycle (G to T coupling) . 
Sulfurization, deprotection and purification followed standard protocols. Yield was 31 .4 nmole (12.6 % overall yield), 
corresponding to 31 .4 nmole phosphorothioate groups. Alkylation was performed by dissolving the residue in 31 .4 u.l 
TE buffer (0.01 M Tris pH 8.0, 0.001 M EDTA) and by adding 16 uJ of a solution of 20 mM solution of 2-iodoethanol 
(320 nmole; i.e., 10-fold excess with respect to phosphorothioate diesters) in N,N-dimethylfonmamide (DMF). The 
alkylated oligonucleotide was purified by standard reversed phase HPLC (RP-1 8 Uftraphere, Beckman; column: 4.5 x 
250 mm; 100 mM triethylammonium acetate, pH 7.0 and a gradient of 5 to 40 % acetonrtrile). 

[0079] In a variation of this procedure, the nucleic acid primer containing one or more phosphorothioate phosphodi- 
ester bond is used in the Sanger sequencing reactions. The primer-extension products of the four sequencing reactions 
are purified as exemplified In EXAMPLES 1 - 4, cleaved off the solid support, lyophilized and dissolved in 4 u.l each of 
TE buffer pH 8.0 and aikylated by addition of 2 ul of a 20 mM solution of 2-iodoethanol in DMF. It is then analyzed by 
ES and/or MALDI mass spectrometry. 

[00S0J In an analogous way, employing instead of 2-iodoethanol, e.g., 3-iodopropanof, 4-iodobutanol mass-modified 
nucleic acid primer are obtained with a mass difference of 14.03, 28.06 and 42.03 daltons respectively compared to 
the unmodified phosphorothioate phosphodiester-contalning oligonucleotide. 

EXAMPLE 14 

Synthesis of 2'-amjno-2'-deoxyuridlne-5'-triphosphate and 3'-amlr>o-2\3'-dldeoxythymldine-5^trlphosphate 
mass- modi fled at the 2'- or 3'-amlno function with glycine or p-alanine residues 

[0081] Starting material was 2'-azido-2'-deoxy uridine prepared according to literature (Verheyden et a/., J. Org, 
Chem, 38, 250 (1971)), which was 4,4-dimethoxytrftylated at 5'-OH with 4,4-dimethoxytrityl chloride in pyridine and 
acetylated at3'-OH with acetic anhydride in a one-pot reaction using standard reaction conditions. With 191 mg (0.71 
mmole) 2 , -azido-2 , -deoxyuridlne as starting material, 396 mg (0.65 mmol, 90.8 %) S'-O ^^lmethoxytrrtyl^-O- 
acetyl-2*-azido-2*-deoxuridine was obtained after purification via silica gel chromatography. Reduction of the azWo 
group was performed using published conditions (Barta et ai, Tetrahedron 46 . 587-594 (1990)). Yield of 5'- 
0-(4,4-dlmethoxytrityl)-3 , -0-acetyl-2 , -amlno-2 , -deoxyuridine after silica gel chromatography was 288 mg (0.49 mmole; 
76 %). This protected ^-amlno-^-deoxyurldlne derivative (588 mg, 1.0 mmole) was reacted with 2 equivalents (927 
mg, 2.0 mmole) N-Fmoc-grycfne pentafluorophenyl ester in 10 ml dry DMF overnight at room temperature in the pres- 
ence of 1 .0 mmole (1 74 uJ) N,N-diisopropylethy famine. Solvents were removed by evaporation in vacuo and the residue 
purified by silica gel chromatography. Yield was 71 1 mg (0.71 mmole, 82 %). Detritylation was achieved by a one hour 
treatment with 80% aqueous acetic acid at room temperature. The residue was evaporated to dryness, co-evaporated 
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twice with toluene, suspended in 1 ml dry acetonitrile and S'-phosphory fated with POCI 3 according to literature 
(Yoshikawa eta/., Bull. Chem. Soc. Japan 42, 3505 (1 969) and Sowa et a!., Bull. Chem. Soc. Japan 48, 2084 (1975)) 
and directly transformed In a one-pot reaction to the S'-triphosphate using 3 ml of a 0.5 M solution (1 .5 mmole) tetra 
(tri-n-butylammonium) pyrophosphate in DMF according to literature (e.g. Seela et a/., Helvetica Chimica Acta 74. 
1 048 (1991)). The Fmoc and the S'-O-acetyl groups were removed by a one-hour treatment with concentrated aqueous 
ammonia at room temperature and the reaction mixture evaporated and lyophiiized. Purification also followed standard 
procedures by using anion-exchange chromatography on DEAE-Sephadex with a linear gradient of triethylammonium 
bicarbonate (0.1 M - 1.0 M). Triphosphate containing fractions (checked by thin layer chromatography on polyethyle- 
neimine cellulose plates) were collected, evaporated and lyophilized. Yield (by UV-absorbance of the uracil moiety) 
was 68% (0.48 mmole). 

[0082] A gfycyl-gtycine modified ^-amlno^'-deoxyuridine-S'-triphosphate was obtained by removing the Fmoc group 
from 5^O-(4,4-dimethoxy1rityl)-3 , <)-acetyl-2-N-(N-94luorenylmethyloxycarbonyl-glycy0 by 
a one-hour treatment with a 20% solution of piperidine in DMF at room temperature, evaporation of solvents, two-fold 
co-evaporation with toluene and subsequent condensation with N-Fmoc-grycine pentafluorophenyl ester. Starting with 
1.0 mmole of the 2 , -N-glycyl-2 , -amfno-2'-deoxyurldJne derivative and following the procedure described above, 0.72 
mmole (72%) of the corresponding ^-(N-glycyJ-glycylJ-^-amino^'-deoxyurldlne^'-triphosphate was obtained. 
[0083] Starting with 5 , -0-(4 l 4-dimethoxytrityl)-3 , -0-acetyl-2 , -amlno-2 , -deoxyuridine and coupling with N-Fmoc-0- 
alanine pentafluorophenyl ester, the corresponding ^-(N-p-alanyO^-amlno-^deoxyuridine-S'-triphosphate can be 
synthesized. These modified nucleoside triphosphates are incorporated during the Sanger DNA sequencing process 
in the primer-extension products. The mass difference between the glycine, 0-alanine and glycyt-gfycine mass-modified 
nucleosides is, per nucleotide incorporated, 58.06, 72.09 and 115.1 daltons respectively. 

[0084] When starting with 5'-0-(4 r 4-dimethoxytrityl)-3 , -amino-2',3 , -dideoxythymidine (obtained by published proce- 
dures, see EXAMPLE 12), the corresponding 3*-(N-glycyl)-3'-amino-/ ^-(-N-glycyl-glycylJ-S'-amino-/ and 3*-(N-p-ala- 
nyO-S'-amino-^.S'-dideoxythymidine-S'-triphosphates can be obtained. These mass-modified nucleoside triphos- 
phates serve as a terminating nucleotide unit in the Sanger DNA sequencing reactions providing a mass difference 
per terminated fragment of 58.06, 72.09 and 115.1 daltons respectively when used In the multiplexing sequencing 
mode. The mass-differentiated fragments can then be analyzed by ES and/or MALDI mass spectrometry. 

EXAMPLE 15 

Synthesis of deoxy u rid ine-S'-trlphosp hate mass-modified at C-5 of the heterocyclic base with glycine, gfycyl- 
glycine and 0-alanlnc residues. 

[0085] 0.281 g (1 .0 mmole) 5-(3-AminopropynyM)-2'-deoxyurldine (see EXAMPLE 6) was reacted with eitherO.927 
g (2.0 mmole) N-Fmoc-glyclne pentafluorophenylester or 0.955g (2.0 mmole) N-Fmoc-0-alanine pentafluorophenyl 
ester in 5 ml dry DMF in the presence of 0.129 g N, N-dllsopropylethylamine (174 ul, 1 .0 mmole) overnight at room 
temperature. Solvents were removed by evaporation in vacuo and the condensation products purified by flash chro- 
matography on silica gel (Still et aL, J. Org. Chem. 43 . 2923-2925 (1 978)). Yields were 476 mg (0.85 mmole: 85%) for 
the glycine and 436 mg (0.76 mmole; 76%) for the 0-alanine derivatives. For the synthesis of the glycyl-glycine deriv- 
ative, the Fmoc group of 1 .0 mmole Fmoc-gjlycine-deoxy uridine derivative was removed by one-hour treatment with 
20% piperidine in DMF at room temperature. Solvents were removed by evaporation in vacuo, the residue was co- 
evaporated twice with toluene and condensed with 0.927 g (2.0 mmole) N-Fmoc-grycine pentafluorophenyl ester and 
purified as described above. Yield was 445 mg (0.72 mmole; 72%). The glycyl-, glycyl-glycyl- and 0-alanyl-2'-deoxy- 
uridine derivatives, N-protected with the Fmoc group were transformed to the 3'-0-acetyl derivatives by tritylatlon with 
4,4-dimethoxytrityl chloride in pyridine and acetyiation with acetic anhydride in pyridine in a one-pot reaction and sub- 
sequently detritylated by one hour treatment with 80% aqueous acetic acid according to standard procedures. Solvents 
were removed, the residues dissolved in 100 ml chloroform and extracted twice with 50 ml 10% sodium bicarbonate 
and once with 50 ml water, dried with sodium sulfate, the solvent evaporated and the residues purified by flash chro- 
matography on silica gel. Yields were 361 mg (0.60 mmole; 71%) for the glycyl-. 351 mg (0.57 mmole; 75%) for the 0- 
alanyl- and 323 mg (0.49 mmole; 68%) for the grycyl-grycyl-3-0'-acetyl-2 , -deoxyuridlne derivatives respectively. Phos- 
phorylation at the 5'-OH with POCI3, transformation into the S'-triphosphate by in-sJtu reaction with tetra(tri-n-buty!am- 
monium) pyrophosphate in DMF, 3*-de-0-acetylation, cleavage of the Fmoc group, and final purification by anion- 
exchange chromatography on DEAE-Sephadex was performed as described In EXAMPLE 14. Yields according to UV- 
absorbance of the uracil moiety were 0.41 mmole 5-(3-(N-glycyl)-amidopropynyl-1)-2 , -deoxyuridine-5 , -trlphosphate 
(84%), 0.43 mmole 5-(3-(N-0-alanyl)-amidopropynyl-1)-2"-deoxyu rid ine-5Mrtphosp hate (75%) and 0.38 mmole 
5-{3-(N-glycyl-glycyl)-amidopropynyM )-2'-deoxyuridine-5 , -triphosphate (78%). 

[0086] These mass-modified nucleoside triphosphates were incorporated during the Sanger DNA sequencing primer- 
extension reactions. 
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[0087] When using S-fS-amlnopropynyMJ^S'-dldeoxyuridine as starting material and following an analogous re- 
action sequence the corresponding glycyl-, glycyl-grycyl-and p-alanyl^'.^-dideoxyuridine-^-triphosphates were ob- 
tained in yields of 69, 63 and 71% respectively. These mass-modified nucleoside triphosphates serve as chain-termi- 
nating nucleotides during the Sanger DNA sequencing reactions. The mass-modified sequencing ladders are analyzed 
by either ES or MALDI mass spectrometry. 

EXAMPLE 16 

Synthesis of 8-grycy I- and 8-glycy l-glycyl-2'-deoxyadeno8tne-5'-triph08phate 

[0068} 727 mg (1 .0 mmole) of N e -(4-tert-butylphenoxyacetyl)-8-g^ycyl-6 -(4 > 4Hdimethoxytrrtyl)-2 , • deoxyadenosine or 
800 mg (1.0 mmole) N 6 -(4-tert-butylphenoxyacetyl)-8-gfycyl-g^ pre- 
pared according to EXAMPLES 10 and 11 and literature (Koster et a/., Tetrahedron 37, 362 (1981)) were acetylated 
with acetic anhydride In pyridine at the 3-OH, detritylated at the 5' -position with 80% acetic acid in a one-pot reaction 
and transformed into the S'-triphosphates via phosphorylation with POCI 3 and reaction in-situ with tetra(trl-n-butylam- 
monium) pyrophosphate as described in EXAMPLE 14. Deprotection of the NS-tert-butylphenoxyacetyf, the 3-O-acetyl 
and the O-methyl group at the glycine residues was achieved with concentrated aqueous ammonia for ninety minutes 
at room temperature. Ammonia was removed by lyophilization and the residue washed with dichloromethane, solvent 
removed by evaporation in vacuo and the remaining solid material purified by an ion -exchange chromatography on 
DEAE-Sephadex using a linear gradient of triethylammonlum bicarbonate from 0.1 to 1 .0 M. The nucleoside triphos- 
phate containing fractions (checked by TLC on poryethylenelmine cellulose plates) were combined and tyophillized. 
Yield of the S-glycyl-^-deoxyadenosine-S'-triphosphate (determined by UV-absorbance of the adenine moiety) was 
57% (0.57 mmole). The yield for the 8-glycy1-g!ycyi-2*-deoxyadenosine-5Mriphosphate was 51% (0.51 mmole). 
[0089] These mass-modified nucleoside triphosphates were incorporated during primer-extension in the Sanger DNA 
sequencing reactions. 

[0090] When using the corresponding N6-(4-tert-buty!phenoxyacetyl)-B-glycyl- or -glycyl- grycyl-5'-0-(4,4-dimethox- 
ytrityO-^.S'-dideoxyadenosine derivatives as starting materials prepared according to standard procedures (see, e.g., 
for the introduction of the 2\3'-f unction: Seela et a/., Helvetica Chimica Acta 74. 1048-1058 (1991)) and using an 
analogous reaction sequence as described above, the chain-terminating mass-modified nucleoside triphosphates 
8-glycyh and S-glycyl-glycy^'.S'-dideoxyadenosine-S'-tnphosphates were obtained in 53 and 47% yields respectively. 
The mass-modified sequencing fragment ladders are analyzed by either ES or MALDI mass spectrometry. 

EXAMPLE 17 

Mass-modification of Sanger DNA sequencing fragment ladders by Incorporation of chaln-elongatlng 2'-deoxy- 
and chain-terminating 2',3 , -dideoxythymldlne-5'-(alpha-S-)-trlphosphate and subsequent alleviation with 
2-lodoethanol and 3-iodopropano) 

[0091] 2',3 , -Dideoxythymldlne-5 , -(alpha-S)-triphosphate was prepared according to published procedures (e.g., for 
the aipha-S-triphosphate moiety: Eckstein et at., Biochemistry 15, 1685 (1976) and Accounts Chem. Res. 12, 204 
(1978) and for the 2\3'-dideoxy moiety: Seela et ai. t Helvetica Chimica Acta , 74, 1048-1058 (1991)). Sanger DNA 
sequencing reactions employing ^-deoxythymfdlne-S'-falpha-SJ-triphosphate are performed according to standard 
protocols (e.g. Eckstein, Ann. Rev. Biochem. 54. 367 (1 985)). When using 2 , ,3 , -dideoxythymidine-5'-(alpha-SHn|3hos- 
phates, this is used instead of the unmodified ^.S'-dideoxythymidine-S'-triphosphate In standard Sanger DNA sequenc- 
ing (see e.g. Swerdlow et al. % Nucleic Acids Res. 18. 1415-1419 (1990)). The template (2 pmole) and the nucleic acid 
M1 3 sequencing primer (4 pmole) modified according to EXAMPLE 1 are annealed by heating to 65°C in 100 ul of 10 
mM Tris-HCI pH 7.5, 1 0 mM MgCI 2 , 50 mM NaCI, 7 mM dithiothneitol(DTT) for 5 min and slowly brought to 37°C during 
a one hour period. The sequencing reaction mixtures contain, as exemplified for the T-specrhc termination reaction, in 
a final volume of 150 ul, 200 uM (final concentration) each of dATP, dCTP, dTTP, 300 uM c7-deaza-dGTP, 5 uM 2\3 # - 
dideoxythymidine-5 , -(alpha-S)-triphosphate and 40 units Sequenase (United States Biochemlcals). Polymerization is 
performed for 1 0 min at 37°C, the reaction mixture heated to 70°C to Inactivate the Sequenase, ethanol precipitated 
and coupled to thlolated Sequelon membrane disks (8 mm diameter) as described in EXAMPLE 1 . Alkylation is per- 
formed by treating the disks with 10 ul of 1 0 mM solution of either 2-iodoethanol or 3-lodopropanoI In NMM (N-meth- 
yimoroholine/watery2-propanol, 2/49/49, v/v/v) (three times), washing with 10 ul NMM (three times) and cleaving the 
alkylated T-terminated primer-extension products off the support by treatment with DTT as described in EXAMPLE 1. 
Analysis of the mass-modified fragment families is performed with either ES or MALDI mass spectrometry. 
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EXAMPLE 18 



Analysis of a Mixture of OllgothymldyNc Acids 

[0092] Oligothymfcrylic acid, oiigo p(dT) 12 . 18> is commercially available (United States Biochemical, Cleveland, OH). 
Generally, a matrix solution of 0.5 M in ethanol was prepared. Various matrices were used for this Example and Ex- 
amples 19- 21 such as 3,5-dihydroxybenzoic acid, sinaplnfc acid, 3-hydroxypicolinic acid, 2,4,6-trihydroxyacetophe- 
none. Oligonucleotides were lyophillzed after purification by HPLC and taken up in ultrapure water (MllliQ, MHIipore) 
using amounts to obtain a concentration of 10 pmoles/u.1 as stock solution. An aliquot (1 u.l) of this concentration or a 
dilution In ultrapure water was mixed with 1 uJ of the matrix solution on a flat metal surface serving as the probe tip 
and dried with a fan using cold air. In some experiments, cation-Ion exchange beads in the acid form were added to 
the mixture of matrix and sample solution. 

[0093] MALDI-TOF spectra were obtained for this Example and Examples 19-21 on different commercial instruments 
such as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons Instruments), LaserTec Research (Vestec). The conditions 
for this Example were linear negative ion mode with an acceleration voftage of 25 kV. The MALDI-TOF spectrum 
generated is shown In FIGURE 14. Mass calibration was done externally and generally achieved by usfng defined 
peptides of appropriate mass range such as insulin, gramicidin S. trypslnogen, bovine serum albumen, and cytochrome 
C. All spectra were generated by employing a nitrogen laser with 5 nsec pulses at a wavelength of 337 nm. Laser 
energy varied between 10 6 and 10 7 W/cm*. To Improve signaMo-nolse ratio generally, the intensities of 10 to 30 laser 
shots were accumulated. 



EXAMPLE 19 



Mass Spectrometry Analysis of a 50-mer and a 99-mer 



[0094] Two large oligonucleotides were analyzed by mass spectrometry. The 50-mer d 
(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) and dT(paT) 99 were 
used. The oligodeoxynucleotides were synthesized using p -cyanoethylphosphoamidites and purified using published 
procedures.(e.g. N.D. Slnha, J. Biernat, J. McManus and H. Koster, Nucleic Acids Res. , 12, 4539 (1984)) employing 
commercially available DNA synthesizers from either Millipore (Bedford, MA) or Applied Btosystems (Foster City CA) 
and HPLC equrpment and RP18 reverse phase columns from Waters (Milford, MA). The samples for mass spetfro- 
metnc analysis were prepared as described In Example 18. The conditions used for MALDI-MS analysis of each oli- 
gonucleotide were 500 fmol of each oligonucleotide, reflectron positive ion mode with an acceleration of 5 kV and 
postacceleration of 20 kV. The MALDI-TOF spectra generated were superimposed and are shown in FIGURE 15. 

EXAMPLE 20 



Simulation of the DNA Sequencing Results of FIGURE 2 



[0095] Tne 1 3 DNA sequences representing the nested dT-termlnated fragments of the Sanger DNA sequencing for 

he S^merdescribed in Example 1 9 (SEQ ID NO:3) were synthesized as described in Example 1 9. The samples were 

™ ^ ea ° h fra 9 ment was analyzed by MALDI-MS as described in Example 18.. The resulting MAL- 

Z I?u ^ Sh ° Wn in FIGU RES 1 6A ~ 1 6M - The were reflectron positive ion mode with an acceleration 

of 5 kV and postacceleration of 20 kV. Calculated molecular masses and experimental molecular masses are shown 



[0096] The MALDI-TOF spectra were superimposed (FIGURES 17A and 17B) to demonstrate that the individual 
peaks are resolvable even between the 1 0-mer and 1 1 -mer (upper panel) and the 37-mer and 38-mer (lower panel) 
The two panels show two different scales and the spectra analyzed at that scale 



EXAMPLE 21 



MALDI-MS Analysis of a Mass-Modified Oligonucleotide 



5°^ c. /J 7 JT W T ma8Mnodlfted at c " 5 <> f one or two deoxyuridlne moieties. 5-[1 3-<2-Methoxyethoxyl)-trkJecyne- 
J ] t~ ( it ^ ^^^^O^-deoxyuridine-S'^-cyanoethyl-N, N-diisopropylphosphoamldite was used to synthe- 
size the modified 1 7-mers using the methods described in Example 19 
[0098] The modified 1 7-mers were 
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f 

a: d (TAAAACGACGGCCAGUG) (molecular mass: 5454) 
(SEQ ID NO:4) 

f T 

b: d (UAAAACGACGGCCAGUG) (molecular mass 5634) 
(SEQIDNO:5) 

where X = -CsC^CH^i i -OH 

(unmodified 17-men molecular mass: 5273) 

[0099] The samples were prepared and 500 fmol of each modified 17-mer was analyzed using MALDI-MS as de- 
scribed in Example 1 8. The conditions used were reflectron positive ion mode with an acceleration of 5 kV and postac- 
celeration of 20 kv\ The MALDI-TOF spectra which were generated were superimposed and are shown in FIGURE 18. 
[0100] AM of the above-cited references and publications are hereby incorporated by reference. 

EQUIVALENTS 

[01011 Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 



<1> GENERAL INFORMATION : 

(i) APPLICANT : 

(A) NAME: ROSTER, HUBERT 

(B) STREET: 1640 MONUMENT STREET 
(C> CITY: CONCORD 

<D) STATE: MASSACHUSETTS 

<E> COUNTRY: USA 

(F) POSTAL CODB (ZIP) : 01742 

(G) TELEPHONE: (508) 369-9790 

(ii) TITLE OF INVENTION: DNA SEQUENCING BY MASS SPECTROMETRY 
(iii) NUMBER OF SEQUENCES: S 

<V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS /MS -DOS 
(D) SOFTWARE: ASCII (text) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE i 06-JAN-1994 
CO CLASSIFICATION: 

(vii) PRIOR APPLICATION nATA: 

(A) APPLICATION NUMBER: US 08/001,323 
<B) FILING DATE: 07- JAN- 1993 

(C) CLASSIFICATION: IB 07 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE/DOCKET NUMBER: HXI-003CP 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX; (617) 227-5941 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : Single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL; YES 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:l: 
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CATGCCATGG CATG 

(2) INFORMATION FOR SEQ IP NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
Uii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 : 

AAATTGTGCA CATCCTOCAG C 

<2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 50 base pairs 
<B) TYPE : nucleic acid 
(C) STRANDEDNES5 : single 
<D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 

Uii> HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
TAACGGTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 



<2) INFORMATION FOR SEQ ID NO:4: 

Ci) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 17 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPB: other nucleic acid 
(iii) HYPOTHETICAL: YES 



« 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
TAAAACGACG GGCCAGXG 
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[2) INFORMATION FOR SBQ ID NO: 5 

■ 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii> HYPOTHETICAL: YES 



(Xi) SEQUENCE DESCRIPTION : SBQ ID NO: 5: 
XAAAACGACG 6GCCAGXG 



Claims 

1. A method for sequencing two or more nucleic acids, comprising: 

generating base-specif ica My terminated nucleic acid fragments from each of the nucleic acids to be sequenced; 
determining the molecular weight of each base-specifically terminated fragment by mass spectrometry; 
and determining the sequences of the nucleic acids by aligning the base-specifically terminated nucleic acid 
fragments according to molecular weight- 
wherein: 

the two or more nucleic acids are sequenced concurrently; and 

the base-specifically terminated nucleic acid fragments generated from one nucleic acid can be differentiated 
from the base-specifically terminated nucleic acid fragments generated from each of the other nucleic acids 
by molecular weight. 

2. The method of claim 1 , wherein base-specifically terminated nucleic acid fragments generated from one or more 
of the nucleic acids are mass modified. 

3. The method of claim 2, wherein the mass-modified base-specifically terminated nucleic acid fragments are modified 
with a mass-modifying functionality (M) according to one or more of the following: 

(a) a mass-modifying functionality (M) that is at a heterocyclic base of at least one nucleotide; 

(b) a mass-modifying functionality (M) attached to the phosphate backbone; and 

(c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 
selected from the group consisting of an internal Opposition, an external C-2* position, and an external C-5* 
position. 

4. The method of claim 3, wherein the heterocyclic base-modified nucleotide is selected from the group consisting 
of a cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at 
the C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, a c 7 -deazaadenine 
modified at C-7, a guanine nucleotide modified at C-8, a c 7 -deazaguanlne modified at C-8, a c 7 -deazaadenine 
modified at C-B, a c 7 -deazaguanfne modified at C-7, a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine 
modified at C-7, and a c 7 -deazahypoxanthine modified at C-8. 

5. The method of claim 1 or 2, wherein each base-specifically terminated nucleic acid fragment is coupled by a linking 
group (L) to a functionality (L*) on a solid support. 
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6. The method of claim 5, wherein the coupling of each base-specifically terminated nucleic acid fragment to the solid 
support is reversible. 

7. The method of claim 5 or 6, wherein the mass-modified base- specifically terminated nucleic acid fragments are 
modified with a mass-modifying functionality (M) attached to the sugar moiety of a S'-terminal nucleotide and where- 
in the mass-modifying function (M) is a linking functionality (L). 

8. The method of claim 6, wherein the base-specifically terminated nucleic acid fragments are cleaved from the solid 
support prior to or during mass spectrometry. 

9. The method of claim 6, wherein the base-specifically terminated nucleic acid fragments are cleaved from the solid 
support enzymatfcally, chemically or physically. 

10. The method of claim 6, wherein the coupling of the base-specifically terminated nucleic acid fragments to the solid 
support is selected from the group consisting of a photodeavable bond, a bond based on strong electrostatic 
interaction, a trityletherbond, a p-benzoylpropionyl group, a levulinyl group, a disulfide bond, an arglnlne/arginine 
bond, a lysine/lysine bond, a pyrophosphate bond, and a bond created by Watson-Crick base pairing. 

11. The method of claim 2, wherein the mass-modified base-specifically terminated nucleic acid fragments are modified 
with a mass- modifying functionality (M) which is attached to the base-specifically terminated nucleic acid fragments 
subsequent to generation of the base-specif icalfy terminated fragments and prior to determining the molecular 
weight of the fragments by mass spectrometry. 

12. The method of 11 , wherein the generation of the base-specifically terminated fragments is performed by using at 
least one reagent selected from the group consisting of a nucleic acid primer, a chain-elongating nucleotide, a 
chain-terminating nucleotide and a tag probe which has been modified with a precursor of the mass-modifying 
functionality, M, and a subsequent step comprises modifying the precursor of the mass-modifying functionality, M, 
to generate the mass-modifying functionality, M, prior to mass spectrometry analysis. 

13. The method of claim 1 , wherein the base-specifically terminated nucleic acid fragments from each of the nucleic 
acids to be sequenced are generated by synthesizing nucleic acids complementary to the nucleic acids to be 
sequenced starting from a nucleic acid primer and in the presence of chain-terminating and chain-elongating nu- 
cleotides. , 

14. The method of claim 13, wherein at least one of the nucleic acid primer, a chain-elongating nucleotide, and a chain- 
terminating nucleotide is mass-modified. 

15. The method of claim 13 or claim 14, wherein the primer is reverslbly linked to a solid support through a linking 
group and the fragments are cleaved from the solid support by a laser during mass spectrometry. 

16. The method of claim 1 , further comprising after the step of determining the molecular weight of each base-specif- 
Icalry terminated fragment by mass spectrometry: 

hybridizing the base-specifically terminated nucleic acid fragments with one or more tag probes; 
determining the molecular weight of each of the base-specifically terminated nucleic acids; and 
comparing the molecular weights of the base-specif teal ry terminated nucleic acids before and after hybridiza- 
tion to the tag probe(s); 

wherein base-specifically terminated nucleic acid fragments generated from one or more of the nucleic acids 
to be sequenced comprise a tag sequence which specifically hybridizes to a tag probe; and 
the tag probes are differentiated by molecular weight. 

17. The method of claim 1 6, wherein the tag probe(s) are covalently bound to the corresponding tag sequence(s) prior 
to mass spectrometric analysis. 

18. The method of claim 17, wherein binding between the tag probe(s) and the corresponding tag sequence(s) is 
achieved photochemicalry via photoactivatabie groups. 
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19. The method of claim 16, wherein mass differentiation of the tag probes is achieved by changing the nucleotide 
composition of at least one of the tag probes and complementary tag sequence in a base-specifically terminated 
nucleic acid. 

20. The method of claim 16, wherein mass differentiation of the tag probes is achieved by mass modification of one 
or more tag probes. 

21 . The method of claim 1 3, wherein at least one of the nucleic acid primer, a chain-elongating nucleotide, and a chain- 
terminating nucleotide comprises a modified nucleotide. 

22. The method of claim 21 , wherein the modified nucleotide is a phosphorothioate nucleotide. 

23. The method of claim 22, wherein the phosphorothioate nucleotide is an alkylated phosphorothioate nucleotide. 

24. The method of claim 5, wherein the coupling of the base-specifically terminated nucleic acid fragments to the solid 
support is effected by a bond cleavable by a pyrophosphatase. 

25. The method of claim 5, further comprising purifying the base-specifically terminated nucleic acid fragments by 
washing out remaining reactants and by-products. 

26. The method of claim 1 , wherein a counter-ion of the phosphate backbone of the base-specifically terminated nucleic 
acid fragments is removed or is exchanged with a second counter-ion. 

27. The method of claim 1 , wherein the molecular weight of each fragment is determined by matrix-assisted laser 
desorption/ionization mass spectrometry (MALDI-MS) or eJectrospray mass spectrometry (ES-MS). 

28. A kit for sequencing two or more species of nucleic acids by multiplex mass spectrometric nucleic acid sequencing, 
comprising: 

a) a solid support having a linking functionality (L^; 

b) nucleic acid primers suitable for initiating synthesis of a set of nucleic acids which are complementary to 
the different species of nucleic acids, the primers each including a linking group (L) able to interact with the 
linking functionality (L 1 ) and reversibly link the primers to the solid support and optionally, a tag probe; 

c) chain-elongating nucleotides for synthesizing the complementary nucleic acids; and 

d) chain-terminating nucleotides for terminating synthesis of the complementary nucleic acids and generating 
sets of base-specif lea try terminated complementary nucleic acid fragments, 

wherein in the absence of a tag probe, at least one reagent selected from the group consisting of the primers, 
the chain-elongating nucleotides, and the chain-terminating nucleotides is mass modified to provide distinction 
between each set of base-specifically terminated nucleic acid fragments of each species of nucleic acid by mass 
spectrometry. 

29. The kit of claim 28, wherein the chain-efongating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (dATP) deoxythymldine triphosphate (oTTP), deoxyguanosine 
triphosphate (dGTP), deoxycytidine triphosphate (DCTP), deoxyinosine triphosphate (diTP), 7-deaza deoxyade- 
nosine triphosphate (c 7 dATP), 7-deaza deoxythymldine triphosphate (C 7 TTP). 7-deaza deoxyguanosine triphos- 
phate (c 7 dGTP), 7-deaza deoxycytidine triphosphate (c 7 dCTP) and 7-deaza deoxyinosine triphosphate (c 7 dlTP). 

30. The kit of claim 28, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of dkleoxyadenosine triphosphate (ddATP), dldeoxythymidine triphosphate (ddTTP), dldeoxy- 
guanoslne triphosphate (ddGTP), dideoxycytidine triphosphate (ddCTP), 7-deaza dideoxyguanoslne triphosphate 
(c 7 ddGTP), 7-deaza dideoxyadenosine triphosphate (c 7 ddATP), 7-deaza dideoxyinosine triphosphate (c 7 ddlTP). 

31. The kit of claim 28, wherein the chain-elongating nucleotides comprise at least one nucleotide selected from the 
group consisting of adenosine triphosphate (ATP), uridine triphosphate (UTP), guanosine triphosphate (GTP), 
cytidine triphosphate (CTP), inosine triphosphate (ITP), 7-deaza adenosine triphosphate (c 7 ATP), 7-deaza gua- 
nosine triphosphate (c 7 GTP), and 7-deaza inosine triphosphate (c 7 ITP). 
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32. The kit of claim 28, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (3'-dATP) t deoxyuridine triphosphate (3'-dUTP), deoxyguanos- 
ine triphosphate (3'-dGTP), deoxycytidine triphosphate (S'-dCTP), 7-deaza 3'deoxy adenosine triphosphate (c 7 
-3'dATP), 7-deaza 3'deoxyguanosine triphosphate (c 7 -3'dGTP) and 7-deaza 3'deoxyinosine (c 7 -3'dlTP), 

33. The kit of claim 28, wherein the linkage between the linking group (L) and the linking functionality (L*) is selected 
from the group consisting of a photocleavabJe bond, a tritylether bond, a p-benzoylpropionyl group, a levulinyl 
group, a disulfide bond, an arginine/arginine bond, a lysine/lysine bond, and a pyrophosphate bond and a bond 
created by Watson-Crick base pairing. 

34. The kit of claim 28, wherein the mass modified reagent is modified with a mass-modifying functionality (M) according 
to one or more of the following: 

(a) a mass-modifying functionality (M) that is at a heterocyclic base of at Jeast one nucleotide; 

(b) a mass-modifying functionality (M) that, when incorporated into a base-specifically terminated nucleic acid 
fragment, is attached to the phosphate backbone; and 

(c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 
selected from the group consisting of a C-2* position, an external C-3* position, and an external C-5* position. 

35. The kit of claim 34, wherein the heterocyclic base-modified nucleotide is selected from the group consisting of a 
cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the 
C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, an adenine nucleotide 
modified at C-7, a c 7 -deazaadenine modified at C-8, a c 7 -deazaadenine modified at C-7, a guanine nucleotide 
modified at C-8 f a guanine nucleotide modified at C-7, a c^-deazaguanine modified at C-8, a c 7 -deazaguanine 
modified at C-7, a hypoxanthfne modified at C-8, a c 7 -deazahypoxanthine modified at C-7, and a c 7 -deazahypox- 
anthlne modified at C-B. 

36. The kit of claim 28, wherein the mass modified reagent is modified with a mass-modifying functionality (M) attached 
to the sugar moiety of a S'-terminal nucleotide and wherein the mass-modifying function (M) is the linking func- 
tionality (L). 

37. The kit of claim 28, wherein the primer or the tag probe comprises a deoxy ribonucleotide sefected from the group 
consisting of: a 7-deaza deoxyadenosine triphosphate, (c 7 dA), a 7-deaza deoxyguanosine triphosphate (c 7 dG) 
and a 7-deaza deoxyinosine triphosphate (c 7 dl). 

38. The kit of claim 28, wherein the primer or the tag probe comprises a ribonucleotide selected from the group con- 
sisting of: 7-deaza adenine (c 7 A), 7-deaza guanine (c 7 G) and 7-deaza inosine (c 7 1). 

39. A kit for sequencing a nucleic acid by mass spectrometry, comprising: 

a) a solid support having a linking functionality (L'); 

b) one or more nucleic acid primers suitable for Initiating synthesis of complementary nucleic acids which are 
complementary to the nucleic acid to be sequenced, the primers each including a linking group (L) able to 
interact with the linking functionality (U) and reversibry immobilize the primers on the solid support; 

c) chain-elongating nucleotides for synthesizing the complementary nucleic acids; and 

d) chain-terminating nucleotides for terminating synthesis of the complementary nucleic acids and generating 
sets of base-specifically terminated complementary nucleic acid fragments, 

wherein the chain-terminating nucleotides are mass-modified so that addition of one chain-terminating nu- 
cleotide to the complementary nucleic acid can be distinguished by mass spectrometry from addition of all other 
chain-terminating nucleotides concurrently analysed. 

40. The kit of claim 39, wherein the chaln-eiongatlng nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (dATP) deoxythymidine triphosphate (dTTP), deoxyguanosine 
triphosphate (dGTP), deoxycyudlne triphosphate (dCTP). deoxyinosine triphosphate (dITP), a 7-deazadeoxygua- 
nosine triphosphate (c 7 dGTP), a 7-deazadeoxyadenosine triphosphate (c 7 dATP), and a 7-deazadeoxylnosine 
triphosphate (c 7 dlTP). 
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41. The kit of claim 39, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of dideoxyadenosine triphosphate (ddATP), dideoxythymidine triphosphate (ddTTP), dideoxy- 
guanosine triphosphate (ddGTP), dldeoxycytidine triphosphate (ddCTP), 7-deazadideoxyguanoslne triphosphate 
(c 7 ddGTP), 7-deazadideoxy adenosine triphosphate (Cj ddATP), 7-deazadideoxy inosine triphosphate (c 7 ddlTP). 

5 

42. The kit of claim 39, wherein the chain -elongating nucleotides comprise a nucleotide selected from the group con- 
sisting of adenosine triphosphate (ATP), uridine triphosphate (UTP), guanoslne triphosphate (GTP), cytidine tri- 
phosphate (CTP), inosine triphosphate (ITP), a 7-deazaadenosine triphosphate (c 7 ATP), a 7-deazaguanosine 
triphosphate (c 7 GTP), and a 7-deazainosine triphosphate (c 7 ITP). 

10 

43. The kit of claim 39, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (3*-dATP), deoxyuridine triphosphate (3'-dUTP), deoxyguanos- 
ine triphosphate (3*-dGTP), deoxycytidine triphosphate (3*-dCTP), 7-deaza 3'deoxyadenosine triphosphate (c 7 
-3'dATP). 7-deaza 3'deoxyguanosine triphosphate (c 7 -3'dGTP) and 7-deaza 3'deoxy inosine triphosphate (c 7 

'5 -3'dlTP). 

44. The kit of claim 39, wherein the linkage between the linking group (L) and the linking functionality (L') is selected 
from the group consisting of a photocleavable bond, a tritylether bond, a p-benzoytpropionyl group, a levulinyl 
group, a disulfide bond, an arginine/arginine bond, a lysine/lysine bond, and a pyrophosphate bond and a bond 

20 created by Watson-Crick base pairing. 

45. The kit of claim 39, wherein the mass modified chain-terminating nucleotide is modified according to one or more 
of the following: 



25 (a) a mass-modifying functionality (M) that is at a heterocyclic base; 

(b) a mass-modifying functionality (M) that, when incorporated into a base-specifically terminated nucleic acid 
fragment, is attached to the phosphate backbone; and 

(c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 
selected from the group consisting of a C-2' position, an external C-3' position, and an external C-5' position. 

30 

46. The kit of claim 45, wherein the heterocyclic base-modified nucleotide is selected from the group consisting of a 
cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the 
C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, an adenine nucleotide 
modified at C-7, a c 7 -deazaadenine modified at C-8, a c 7 -deazaadenine modified at C-7, a guanine nucleotide 
55 modified at C-8, a guanine nucleotide modified at C-7, a c 7 -deazaguanlne modified at C-8, a c 7 -deazaguanine 

modified at C-7, a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine modified at C-7, and a c 7 -deazahypox- 
anthine modified at C-8. 



47. An Intact ionized and volatilized mass-modified nucleic acid molecule, comprising at least two mass-modified nu- 
cleotides, wherein the molecule is positively charged. 

48. An intact ionized mass-modified nucleic acid molecule of claim 47, comprising at least two mass-modified nucle- 
otides containing a mass-modifying functionality (M) attached to a heterocyclic base of the nucleotide. 

49. An intact ionized mass-modified nucleic acid molecule of claim 47, comprising at least two mass-modified nucle- 
otides containing a mass-modifying functionality (M) attached to at least one phosphorus of the nucleotide. 



50. An Ionized intact mass-modified nucleic acid molecule of claim 47, wherein a mass-modifying functionality (M) 
incorporated into the molecule is XR, wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, 
-OCO(CH 2 ) r COO- (where r=1 -20), -NHCOtCH^OO- (where r=1 -20), -OSO^- and R Is selected from the group 
consisting of H, methyl, ethyl, propyl, tsopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trttyt, 
aryl, substituted an/I, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene Imine, -<NH(CH2) r NH- 
CO(CH 2 ) f CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -fMH-(CH 2 ) r -COOH, -(OJCK^CO-)^ 
0-(CH 2 ) r -COOH, -SI(Y) 3 , -(NHCHaaCOOH), -(CH 2 CH a O) m -CH 2 CH20H, and -<CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where 
m is in the range of 0 to 200, Y is a lower alkyl group selected from a group consisting of methyl, ethyl, propyl, 
tsopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the amino acid side chain of a naturally 
occurring amino acid. 
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51. An intact ionized mass-modified nucleic acid molecule, comprising: 

at least one mass modified nucleotide, wherein the molecule is positively charged, and comprises a member 
selected from the group consisting of: a mass-modified universal primer and a mass-modified initiator oligo- 
nucleotide. 

52. An ionized mass-modified nucleic acid molecule of claim 51 , wherein a mass-modifying functionality (M) is attached 
to at least one sugar moiety of a S'-terminal nucleotide of the primer, and wherein the mass-modifying function (M) 
is a linking functionality (L). 

53. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least one mass-modified nucleotide containing a modified heterocyclic base selected from a group consisting 
of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety modified at the methyl 
group of C-5, a uracil moiety modified at C-5, an adenine moiety modified at C-8, a c 7 -deazaadenine moiety 
modified at C-8, a c 7 -deazaadenlne moiety modified at C-7, a guanine moiety modified at C-8, a c 7 -deaza- 
guanine moiety modified at C-8, a c 7 -deaza guanine moiety modified at C-7, a hypoxanthlne moiety modified 
at C-8, a c 7 -deazahypoxanthlne moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at C-7. 

54. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least one mass-modified nucleotide containing a mass-modifying functionaiity (M) attached to at least one 
sugar moiety of the nucleotide. 

55. An Intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

a mass-modifying functionality (M) attached to at Jeast one sugar moiety of the nucleic acid molecule, wherein 
the sugar is modified at a position selected from the group consisting of an Internal C-2' position, an external 
C-2' position, and an external C-5' position. 

56. An intact ionized mass-modified nucleic acid molecule, comprising at least one mass-modified nucleotide contain- 
ing a mass-modifying functionality (M) incorporated into the molecule, wherein (M) is selected from the group 
consisting of F, CI, Br, I, SKCH^, SKCHg^fCgHs), Si(CH 3 )(C 2 H 5 ) 2 , SffCy-Jsfe, CH 2 F, CHF 2 , and CF 3 . 

57. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least two mass-modified nucleotides, wherein a mass-modifying functionality (M) incorporated into the at 
least one mass-modified nucleotide is generated from a precursor functionaiity (PF) attached to one or more 
of a nucleic acid primer, a chain-elongating nucleoside triphosphate or a chain-terminating nucleoside triphos- 
phate, and wherein the precursor functionality (PF) is selected from the group consisting of -N 3 , -NH 2 , -SH, 
-NCS, -OCO(CH 2 ),COOH (where r-1-20), -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCOfCH^I 
(where r=1 -20), and -OP(0-Alkyl)N(Alky!) 2 . 

58. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

two or more mass modified nucleotides selected from the group consisting of a mass-modified 2' deoxy nu- 
cleotide, a mass-modified ^^'-dideoxynucieotide, a mass-modified nucleotide and a mass-modified 3'-deox- 
ynucleotide, wherein the two or more mass-modified nucleotides are different from each other. 

59. An ionized mass-modified nucleic acid molecule, comprising at least one mass modified nucleotide selected from 
the group consisting of a mass-modified 2'-deoxy nucleotide, a mass-modified 2',3'-dideoxynucleotide, a mass- 
modified nucleotide and a mass-modified 3' -deoxy nucleotide, wherein the mass modified nucleic acid molecule 
comprises a modified heterocyclic base selected from a group consisting of modified heterocyclic base Is a c 7 - 
deazaadenine moiety modified at C-8, ac 7 -deazaadenlne moiety modified at C-7, a c 7 -deazaguanine moiety mod- 
ified at C-B, a c 7 -deazaguanine moiety modified at C-7, a hypoxanthine moiety modified at C-8, a c 7 -deazahypox- 
anthine moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at C-7. 

60. An ionized intact mass-modified nucleic acid molecule, comprising at least one mass modified nucleotide wherein 
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a mass-modifying functionality (M) incorporated Into the molecule is generated from a precursor functionality (PF) 
attached to one or more of a nucleic acid primer, a chain-elongating nucleoside triphosphate or a chain-terminating 
nucleoside triphosphate, and wherein the precursor functionality (PF) is selected from a group consisting of -N 3 , 
-NH 2 , -SH, -NCS, -OCO(CH 2 ) r COOH (where r=1-20) t -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCO 
(CH 2 ) r l (where r=1-20), -CONH 2 , -NH-C(S)-NH 2 , OP(0-Alkyl)OH, and O-CO-CKj-SH. 

61 . An ionized intact mass-modified nucleic acid molecule, comprising at least two mass modified nucleotides, wherein 
a mass-modifying functionality (M) incorporated into the molecule is XR, wherein X is selected from the group 
consisting of -O-, -NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO-(where r=1-20), -NHCCKCH^COO- (where r=1-20) t 
-OS0 2 0- and -OP(0-Alkyl)0- and R is selected from the group consisting of H, methyl, ethyl, propyl, Isopropyl, t- 
butyJ, hexyl, benzyl, benzhydryl, halogen, trftyl, substituted trttyl, aryl, substituted aryl, (-NH^H^HCO 
(CH 2 ) r CO-) fI) -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, (-NH(CH 2 ) r CO-) m -NH-(CH 2 ) r CX)OH, (^(CH^CO-)^ 
0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, (CH 2 CH 2 0) ni -CH 2 CH 2 OH i and 
-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

62. An ionized intact mass-modified nucleic acid molecule, comprising at least two mass modified nucleotides, wherein 
a mass-modifying functionality (M) incorporated into the molecule is XR, wherein X is selected from the group 
consisting of -O-, -NH-, -S-, -NHC(S)-, OCO(CH 2 ) r COO- (where r=1-20), -NHC(O), -CONH-, -NH-C(S)-NH-, -NH- 
CO(CH2) r COO- (where r=1-20), -OS0 2 0-, -OCO-CH 2 -S- and -OP(0-Alkyl)0- and R is selected from the group 
consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityi, substituted 
trttyl. aryl, substituted aryl, (CH 2 ) m -CH 2 -OH, (CH 2 ) m -CH 2 -0-Y, (CH 2 CH 2 NH) ra -CH 2 -CH 2 -NH 2 , -(NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, 
-(NH-CHY-CO) m -NH-CHY-COOH, (-OfCH^CO-^-O-fCH^-COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH , 
CH 2 F, CHF 2 , CF 3 , -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, 
Y is a lower alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r Is 
in the range of 1 to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

63. A set of mass-differentiated tag probes wherein, 

each tag probe in the set comprises a sequence of nucleotides which Is complementary by Watson-Crick 
base pairing to a tag sequence present within at least one set of base-specifically terminated fragments; 
the tag sequences to which each tag probe is complementary are different for each tag probe; 
each tag probe in the set comprises at least one mass-modified nucleotide; and 

the mass-modified nucleotides are not isotopically labeled and have different mass modifications in each tag 
probe. 

64. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
comprises a mass-modifying functionality (M) attached to the heterocyclic base. 

65. The set of mass-differentiated tag probes of claim 64, wherein the mass-modified heterocyclic base is selected 
from the group consisting of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety 
modified at the C-5 methyl group, a uracil moiety modified at C-5. an adenine moiety modified at C-8, a c 7 -dea- 
zaadenine moiety modified at C-8, a, a c 7 -deazaadenine moiety modified at C-7, a guanine moiety modified at C- 
8, a c 7 -deazaguanine moiety modified at C-8, a c 7 -deazaguanine moiety modified at C-7, a hyp ©xanthine moiety 
modified at C-8, a c 7 -deazahypoxanthlne moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at 
C-7. 

66. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
comprises a mass-modifying functionality (M) attached to the phosphorus atom forming an intern ucleotidic linkage 
of the tag probe. 

67. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
comprises a mass-modifying functionality (M) attached to the sugar moiety. 

68. The set of mass-differentiated tag probes of claim 63, wherein at least one of the tag probes further comprises a 
cross-linking group (CL) which allows for covalent binding to the corresponding and complementary tag sequences. 
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69. The set of mass-differentiated tag probes of claim 68, wherein the cross-linking group (CL) Is activated photo- 
chemicaJly and is derived from at least one photoactfvatable group selected from the group consisting of psoralen 
and an ellipticine. 

70. The set of mass-differentiated tag probes of claim 63, wherein at least one of the tag probes is mass-modified with 
a mass-modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH3) 3 , SI(CH 3 )2(C 2 H 5 ), 
Si(CH3)(C2H 5 )2, Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, 
-S-, -NHC(S)-, -OCO(CH 2 ) r COO- (where r=1 -20), -NHCO(CH 2 ) r COO- (where r=1 -20), -OS0 2 0-, and R is selected 
from the group consisting of H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, ben2yl, benzhydryl, halogen, trityl, 
substituted trttyl, aryl, substituted aryl, poryoxymethylene, monoalkylated polyoxymethyiene, a polyethylene (mine, 

(NH(CH2) r NHCO(CH 2 ) r CO-) rn -NH-(CH 2 ) r -NH-CO-(CH 2 ) f -COOH, -(NH(CH 2 ) r CO-) TT1 -NH-(CH 2 ) r -COOH, (O 
(CH 2 )^0-) m -0-(Cr^) r -COOH, -Si(Y)a, -(NHCHaaCOOH), -(OH 2 CH 2 0) ra -CH 2 CH 2 OH, and 
-(CH^^O^-C^CHgO-Y, where m Is in the range of 0 to 200, Y is a lower alky I group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

71. The set of mass-differentiated tag probes of claim 63, wherein one or more mass-modifying functionalities incor- 
porated into the probes are generated from a precursor functionality (PF) attached to the mass-differentiated tag 
probes, and wherein the precursor functionalities are selected from the group consisting of -N 3 , -NHg, -SH, -NCS, 
-OCO(CH 2 ) r COOH (where r=1-20), -NHCOfCH^COOH (where r=1-20), -OSO 2 0H, -OCO(CH 2 ) r l (where r=1-20), 
-CONHg, -NH-C(S)-NH 2 , OP(0-Alkyl)OH, -OP(0-AJkyi)N(Alkyl) 2 , and 0-CO-CH 2 -SH. 

72. The set of mass -differentiated tag probes of claim 63, wherein the tag probes are mass-modified with a mass- 
modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H5), Si(CH 3 ) 
(C^s)^ Si(C2H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC 
(S)-, -OCO(CH 2 ) f COO- (where r=1-20), -NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyf)0- and R 
is selected from the group consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, 
halogen, trityl, substituted trityl, aryl, substituted aryl, (-NH(CH 2 ) r NHCO 
(CHa^CO-^-NH^CH^-NH-CO-tCH^.-COOH, (NH(CH 2 ) f CO-) m -NH-(CH 2 ) f -COOH, (-0(CH 2 ),CO-) m - 
0-(CH 2 ) r -COOH, Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and 
-(CH^HgO^-C^Cr^O-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyi and hexyl, r is in the range of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

73. The set of mass-differentiated tag probes of claim 63, wherein the tag probes are mass-modified with a mass- 
modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH a ) 2 (C 2 H 5 ), SifCHa) 
(C 2 H5) 2 , SifC^HjJa, CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC 
(S)-, -OCO(CH 2 ) r COO- (where r=1-20), -NHC(O), -CONH-, -NH-C(S)-NH-, -NHCO(CH 2 ) f COO- (where r=1-20), 
-OS0 2 0- t -0-CO-CH 2 -&- and -0P(O-Alkyl)0- and R is selected from the group consisting of H, N 3 , methyl, ethyl, 
propyl, isopropyl, t-butyl, hexyi, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substituted aryl, 
(CH 2 ) m -CH 2 -OH, (CH2) m -CH 2 -0-Y, (CH 2 CH 2 NH) m -CH 2 -CH 2 -NH 2 , -(NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(NH-CHY-CO) 
m-NH-CHY-COOH, (<KCH 2 )£0-) m -0-(CH 2 ) r -COOH, Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, CH 2 F, CHF 2 , 
CF 3 , -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower 
alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range 
of 1 to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

74. An ionized positively charged intact duplex, comprising a mass- modified tag probe bound to a tag sequence present 
within a base-specifically terminated nucleic acid fragment, wherein the mass-modified tag probe comprises at 
least one mass-modified nucleotide. 

75. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein the mass-modified tag probe comprises at least one mass- 
modified nucleotide selected from the group consisting of a mass-modified nucleotide comprising a mass-modifying 
functionality (M) attached to the heterocyclic base, a mass-modified nucleotide comprising a mass-modifying func- 
tionality (M), which, when incorporated into the tag probe, is attached to the phosphorus atom forming an internu- 
cleotldlc linkage of the tag probe and a mass-modified nucleotide comprising a mass-modifying functionality (M) 
attached to the sugar moiety. 
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76. The Ionized duplex of claim 75, wherein the mass-modified heterocyclic base is selected from the group consisting 
of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety modified at the C-5 
methyl group, a uracil moiety modified at C-5, an adenine moiety modified at C-8, a c 7 -deazaadenine moiety 
modified at C-8, a, a c 7 -deazaadenine moiety modified at C-7, a guanine moiety modified at C-8, a c 7 -deazaguan ine 
moiety modified at C-8, a c 7 -deazaguanine moiety modified at C-7, a hypoxanthine moiety modified at C-B, a c 7 - 
deazahypoxanthine moiety modified at C-8, and a c7-deazahypoxan thine moiety modified at C-7. 

77. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucfeic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe further comprises a cross-linking group (CL) which allows for covalent 
binding to the tag sequence. 

78. The ionized duplex of claim 77, wherein the cross-linking group (CL) is activated photochemicalfy and is derived 
from at least one photoactivatable group selected from the group consisting of psoralen and an elllpticine. 

79. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe is mass-modified with a mass-modifying functionality (M) selected from the 
group consisting of XR, F, CI, Br, I, SKCM^, Si(CH 3 ) 2 (C 2 H 5 ), Si(CH3)(C 2 H 5 )2, Si(C 2 H 5 )3, CH 2 F, CHF 2 , and CF 3> 
wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-. -OCOfCH^COO- (where r=1-20), 
-NHCO(CH2) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyl)0- and R is selected from the group consisting of 
H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substi- 
tuted aryl, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, -(NH(CH 2 ) r NHCO 
(CH 2 ) f CO-) ra -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH I -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, (0(CH 2 ) r CO-) m - 
0-(CH 2 ),-COOH, -Si(Y) 3 , -(NHCHaaCO) m -HCHaaCOOH, -<CH 2 CH 2 0) m -CH 2 CH 2 OH, and 
-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

80. An Ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specificaliy terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe Is mass-modified with a mass -modifying functionality (M) selected from the 
group consisting of XR, F, CI, Br, I, SKCH^, Si(CH 3 ) 2 (C 2 H 5 ) t Si(CH 3 )(C 2 H 5 ) 2 , S\(C^i s ) 3t CH 2 F, CHF 2 , and CF 3 , 
wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, -OCOfCH^COO- (where r=1-20), 
-NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyi)0- and R is selected from the group consisting of 
H, methyt, ethyl, propyl, Isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substi- 
tuted aryl, (-NH(CH 2 ) r NHCO(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, (-NH 
(CH 2 ) r CO-) m -NH-(CH2) r -COOH, (-0(CH 2 ) r CO-) ro -0-(CH 2 ) r -COOH, -SI(Y) 3 , (NHCHaaCO-) m -NHCHaaCOOH, 
(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH2CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl 

group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyJ and hexyl, r is in the range of 1 
to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

81. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe is mass-modified with a mass-modifying functionality (M) selected from the 
group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ) l SKCr-y^H.^, Si(C 2 H 5 >3, CH 2 F, CHF 2 , and CF 3 , 
wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO- (where r=1-20), 
-NHC(C), -CONH-, -NH-C(S)-NH- i -NHCOfCH^COO- (where r=1-20), -OS0 2 0-, -0-CO-CH 2 -S- and -OP 
(O-Alkyl)O- and R is selected from the group consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, 
benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substituted aryl, (CH 2 ) ro -CH 2 -OH, (CH 2 ) m -CH 2 -0-Y, 
(CH 2 CH 2 NH) m -CH 2 -CH 2 -NH 2 , -(NHfCH^NHCO^H^^O-^-NH-^H^r-NH-CO-fCHaVCOOH, -<NH 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(NH-CHY-CO)m-NH-CHY-COOH, (-0(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3( 
-(NHCHaaCO-) m -NHCHaaCOOH, CH 2 F, CHF 2 , CF 3 , -(CHjCHaO^-CHaCHj.OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, 
where m is In the range of 0 to 200, Y is a lower alkyl group selected from a group consisting of methyl, ethyl, 
propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the amino acid side chain of a 
naturally occurring amino acid. 

82. An Ionized Intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
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specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and one or more mass -modifying functionalities (M) incorporated into the tag probe are gen- 
erated from one or more precursor functionalities (PF) attached to the tag probe, and wherein the precursor func- 
tionalities (PF) are selected from the group consisting of-N 3 , -NHj, -SH, -NCS, -OCO(CH 2 ) r COOH (where r=1 -20), 
-NHCO(CH 2 ) f COOH (where r=1-20), -OS0 2 OH, -OCO(CH 2 ) r l (where r=1-20), -OP(0-Alkyl)N(Alkyl) 2 , -CONH^ 
-NH-C(S)-NH 2 . OP(0-Alkyl)OH, and OCO-CH 2 -SH. 

83. An intact positively charged ionized and volatilized mass-modified nucleic acid molecule, comprising at feast one 
mass-modified nucleotide selected from the group consisting of a mass-modified 2*-deoxy nucleotide, a mass- 
modified 2' ,3'-dideoxy nucleotide and a mass-modified 3 -deoxy nucleotide. 

84. An intact ionized and volatilized mass-modified nucleic acid molecule, comprising at least two mass modified 
nucleotides. 

85. A solid support, comprising a linking functionality, linked to a nucleic acid primer via a linking group, L, of the 
primer to form a linkage L-L\ wherein: 

the interaction between L and L* is selectively cleavabJe enzymatically, chemically or physically; 
the primer, which is a primerfor enzymatic synthesis of nucleic acids, comprises a mass-modifying functionality 
(M) that introduces defined mass increments into the oligonucleotide molecule for mass-resolution by mass 
spectrometry, that is not a radiolabel or a fluorescent label, and that is linked directly to the primer, or the 
primer comprises an initiated nucleic acid chain that contains a nucleotide with a mass-modifying functionality 
(M); and 

the linkage L-L", is selected from the group consisting of a photocleavable bond, a bond based on a strong 
electrostatic interaction, a tritylether bond, a 0-benzoylproplonyl group and a levulinyl group. 

86. The solid support of claim 85, wherein: 

the mass-modification is a modification of a sugar moiety, base moiety or phosphate backbone; and 

is a modification of a nucleobase or bases in the chain or in the primer, to the phosphate backbone in the chain 

or in the primer or to a 2*-position of the nucleoside or nucleosides in the chain or in the primer. 

87. A microtiter plate adapted with a functionalized membrane, comprising a solid support and a reversibry linked 
nucleic acid primer in each well. 

88. The solid support according to claim 85, wherein the photocleavable bond of linkage L-L*. is selected from the 
group consisting of a charge transfer complex and a moiety, which forms a stable organic radical upon cleavage. 

89. A solid support having a linking functionality, L\ linked to a primer via a linking group, L, forming a photocleavable 
bond L-L\ wherein the photocleavable bond is selected to be selectively cleaved by ultraviolet laser energy. 

90. The solid support of claim 86, wherein the mass modifying functionality (M) Is attached to a heterocyclic base of 
at least one nucleotide, thereby forming a heterocyclic base-modified nucleotide. 

91. The solid support of claim 85, wherein the mass modifying functionality (M) is attached to a heterocyclic base of 
at least one nucleotide, thereby forming a heterocyclic base-modified nucleotide; and 

the heterocyclic base-modified nucleotide is selected from the group consisting of a cytosine nucleotide 
modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the C-5 methyl group, a 
uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8 t a c 7 -deazaadinine nucleotide modified 
at C-8, a c 7 -deazaadinlne nucleotide modified at C-7, a guanine nucleotide modified at C-8, a c 7 -deazaguanlne 
nucleotide modified at C-8, a c 7 -deazaguanine nucleotide modified at C-7, a hypoxanthine nucleotide modified at 
C-8, a c 7 -deazahypoxanthin e n ucleotlde modified at C-7, and a c 7 -deazahypoxanthlne nucleotide modified at C-8 . 

92. The solid support of claim 86, wherein the mass-modifying functionality (M) is attached to one or more phosphorous 
atoms of an internucleotidic linkage of the primer or of the primer initiated nucleic acid chain. 

93. The solid support of claim 86, wherein the mass modifying functionality (M) is attached to one or more sugar 
moieties of nucleotides of the primer or primer Initiated nucleic acid chain at least one sugar position selected from 
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the group consisting of an Interna) C-2' position, an external C-2' position, and an external C-5' position. 

The solid support of claim 86, wherein the mass-modifying functionality (M) is attached to the sugar moiety of a 5' 
terminal nucleotide and wherein the mass-modifying function (M) is the linking group (L). 

The solid support of claim 86, comprising a set of base-specifically terminated fragments that comprise a mass 
modifying functionality, wherein the mass modifying functionality (M) is attached to the set of base-specifically 
terminated fragments subsequent to enzymatic synthesis of the base-specifically terminated fragments and prior 
to determining the molecular weight values for the fragments by mass spectrometry. 

The solid support of claim 85 or 86, which is selected from the group consisting of: a bead, capillary, polymeric 
sheet, glass piate, and metal surface. 

The solid support of claim 96, wherein the bead is selected from the group consisting of: a magnetic bead, a 
cellulose bead, polystyrene bead, Controlled Pore Glass (CPG) bead, silica-gel bead, a cross-linked dextran bead 
and an agarose bead. 

A solid support, comprising a linking functionality, L', reversibly linked to a nucleic acid primer via a linking group, 
L, of the primer to form a linkage L-L\ wherein: 

the interaction between L and U is selectively cteavable enzymaticafly, chemically or physically; 
the primer, which is for enzymatic synthesis of nucleic acid molecules, comprises a mass -modifying function- 
ality (M) that introduces defined mass increments into the oligonucleotide molecule for mass-resolution by 
mass spectrometry, that is not a radiolabel and that Is linked directly to the primer, or the primer comprises an 
initiated nucleic acid chain that contains a nucleotide with a mass-modifying functionality (M); and 
the linkage L-L\ is a photocleavable bond or a bond based on a strong electrostatic interaction. 

99. A solid support, comprising a linking functionality, L\ reversibly linked to a nucleic acid primer via a finking group, 
L, of the primer to form a linkage L-L\ wherein: 

30 

the interaction between L and L' is cleavable enzymaticafly, chemically or physically; and 
the primer contains a mass-modifying functionality (M) that is not a radiolabel or a fluorescent label, or the 
primer comprises an initiated nucleic acid chain that contains a nucleoside triphosphate with a mass-modifying 
functionality (M) that is not a radiolabel or a fluorescent label. 

35 

100. A method of sequencing a nucleic acid, comprising: 

a) generating base-specifically terminated nucleic acid fragments from the nucfeic acid to be sequenced; 

b) exposing the base-specifically terminated nucleic acid fragments to a single laser to produce desorbed/ 
40 ionized fragments; 

c) determining the molecular weight value of each des orbed/ionized fragment produced by step (b) by mass 
spectrometry; and 

d) determining the nucleotide sequence by aligning the base-specif icaffy terminated nucleic acid fragments 
according to molecular weight. 

45 

101 .A method of sequencing a nucleic acid, comprising: 

generating base-specifically terminated nucleic acid fragments from the nucleic acid to be sequenced; 
determining the molecular weight value of each base-specifically terminated fragment simultaneously by mass 
50 spectrometry; and 

determining the the nucleotide sequence by aligning the base-specif ica fly terminated fragments according to 
molecular weight. 

102. The method of claim 100 or 101 , wherein the base-specifically terminated fragments are purified before the step 
5S of determining the molecular weight values by mass spectrometry. 

103. The method of claim 1 02, wherein the base-specifically terminated fragments are purified by a method comprising: 
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immobilizing the base-specifically terminated nucleic acid fragments on a solid support; and 
washing out all remaining reactants and by-products. 

1G4.The method of claims 101 or 1 02 wherein a counter- Ion of the phosphate backbone of the base-specrfically ter- 
minated nucleic acid fragments is removed or is exchanged with a second counter-ion. 

105.The method of claim 101 or 102, wherln the molecular weight value of each fragment Is determined by matrix- 
assisted laser desorption/ionlzatlon (MALDI-MS). 
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FIG. I6A 
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FIG.I6E 
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FIG.I6G 
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FIG.I6I 
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FIG.I6K 
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FIG. 21 
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